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The Effects of Psychotherapy: An Evaluation 
H. J. Eysenck 


Institute of Psychiatry, Maudsley Hospital 
University of London 


The recommendation of the Committee on 
Training in Clinical Psychology of the Ameri- 
can Psychological Association regarding the 
training of clinical psychologists in the field of 
psychotherapy has been criticized by the writer 
in a series of papers [10, 11, 12]. Of the 
arguments presented in favor of the policy ad- 
vocated by the Committee, the most cogent one 
is perhaps that which refers to the social need 
for the skills possessed by the psychotherapist. 
In view of the importance of the issues in- 
volved, it seemed worth while to examine the 
evidence relating to the actual effects of psy- 
chotherapy, in an attempt to seek clarification 
on a point of fact. 


Base Line and Unit of Measurement 


In the only previous attempt to carry out 
such an evaluation, Landis has pointed out that 
“before any sort of measurement can be made, 
it is necessary to establish a base line and a 
common unit of measure. The only unit of 
measure available is the report made by the 
physician stating that the patient has recovered, 
is much improved, is improved or unimproved. 
This unit is probably as satisfactory as any type 
of human subjective judgment, partaking of 
both the good and bad points of such judg- 
ments” [26, p. 156.] For a unit Landis 
suggests “that of expressing therapeutic re- 
sults in terms of the number of patients re- 
covered or improved per 100 cases admitted to 
the hospital.” As an alternative, he suggests 
“the statement of therapeutic outcome for some 
given group of patients during some stated in- 
terval of time.” 


Landis realized quite clearly that in order to . 


evaluate the effectiveness of any form of 
therapy, data from a control group of nontreat- 
ed patients would be required in order to com- 
pare the effects of therapy with the spontaneous 
remission rate. In the absence of anything bet- 


ter, he used the amelioration rate in state 
mental hospitals for patients diagnosed under 
the heading of “neuroses.” As he points out: 


There are several objections to the use of the 
consolidated amelioration rate . . of the . 
state hospitals ... as a base rate for spontaneous 
recovery. The fact that psychoneurotic cases are 
not usually committed to state hospitals unless in a 
very bad condition; the relatively small number 
of voluntary patients in the group; the fact that 
such patients do get some degree of psychotherapy 
especially in the reception hospitals; and the prob- 
ably quite different economic, educational, and so- 
cial status of the State Hospital group compared to 
the patients reported from each of the other hospi- 
tals—all argue against the acceptance of [this] fig- 
ure ... as a truly satisfactory base line, but in the 
absence of any other better figure this must serve 
[26, p. 168]. 


Actually the various figures quoted by Land- 
is agree very well. The percentage of neurotic 
patients discharged annually as recovered or 
improved from New York state hospitals is 70 
(for the years 1925-1934); for the United 
States as a whole it is 68 (for the years 1926 
to 1933). The percentage of neurotics dis- 
charged as recovered or improved within one 
year of admission is 66 for the United States 
(1933) and 68 for New York (1914). The 
consolidated amelioration rate of New York 
state hospitals, 1917-1934, is 72 per cent. As 
this is the figure chosen by Landis, we may ac- 
cept it in preference to the other very similar 
ones quoted. By and large, we may thus say 
that of severe neurotics receiving in the main 
custodial care, and very little if any psycho- 
therapy, over two-thirds recovered or improved 
to a considerable extent. “Although this is not, 
strictly speaking, a basic figure for ‘spontane- 
ous’ recovery, still any therapeutic method 
must show an appreciably greater size than this 
to be seriously considered” [26, p. 160]. 

Another estimate of the required “base 
line” is provided by Denker: 
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Five hundred consecutive disability claims due to 
psychoneurosis, treated by general practitioners 
throughout the country, and not by accredited spe- 
cialists or sanatoria, were reviewed. All types of 
neurosis were included, and no attempt made to 
differentiate the neurasthenic, anxiety, compulsive, 
hysteric, or other states, but the greatest care was 
taken to eliminate the true psychotic or organic 
lesions which in the early stages of illness so often 
simulate These cases were taken consec- 
utively from the files of the Equitable Life Assur- 
ance Society of the United States, were from all 
parts of the country, and all had been ill of a neu- 
rosis for at least three months before claims were 
submitted. They, therefore, could be fairly called 
“severe,” since they had been totally disabled for at 
least a three months’ period, and rendered unable to 


‘ 


neurosis. 


carry on with any “occupation for remuneration or 


profit” for at least that time [9, p. 2164]. 

These patients were regularly seen and 
treated by their own physicians with sedatives, 
tonics, suggestion, and reassurance, but in no 
case was any attempt made at anything but 
this most superficial type of “psychotherapy” 
which has always been the stock-in-trade of the 
general practitioner. Repeated statements, 
every three months or so by their physicians, 
as well as independent investigations by the in- 
surance company, confirmed the fact that these 
people actually were not engaged in produc- 
tive work during the period of their illness. 
During their disablement, these cases received 
disability benefits. As Denker points out, “It 
is appreciated that this fact of disability income 
may have actually prolonged the total period 
of disability and acted as a barrier to incentive 
for recovery. One would, therefore, not expect 
the therapeutic results in such a group of cases 
to be as favorable as in other groups where the 
economic factor might act as an important spur 
in helping the sick patient adjust to his neurocic 
conflict and illness” [9, p. 2165]. 


The cases were all followed up for at least 
a five-year period, and often as long as ten 
years after the period of disability had begun. 
The criteria of “recovery” used by Denker 
were as follows: (a) return to work, and 
ability to carry on well in economic adjust- 
ments for at least a five-year period; (4) com- 
plaint of no further or very slight difficulties; 
(c)making of successful social adjustments. 
Using these criteria, which are very similar to 
those usually used by psychiatrists, Denker 
found that 45 per cent of the patients recovered 
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after one year, another 27 per cent after two 
years, making 72 per cent in all. Another 10 
per cent, 5 per cent, and 4 per cent recovered 
during the third, fourth, and fifth years, re- 
spectively, making a total of 90 per cent re- 
coveries after five years. 

This sample contrasts in many ways with 
that used by Landis. The cases on which 
Denker reports were probably not quite as 
severe as those summarized by Landis; they 
were all voluntary, nonhospitalized patients, 
and came from a much higher socioeconomic 
stratum. The majority of Denker’s patients 
were clerical workers, executives, teachers, and 
professional men. In spite of these differences, 
the recovery figures for the two samples are 
almost identical. The most suitable figure to 
choose from those given by Denker is probably 
that for the two-year recovery rate, as follow- 
up studies seldom go beyond two years and 
the higher figures for three-, four-, and five- 
year follow-up would overestimate the eff- 
ciency of this “base line” procedure. Using, 
therefore, the two-year recovery figure of 72 
per cent, we find that Denker’s figure agrees 
exactly with that given by Landis. We may, 
therefore, conclude with some confidence that 
our estimate of some two-thirds of severe 
neurotics showing recovery or considerable im- 
provement without the benefit of systematic 
psychotherapy is not likely to be very far out. 

Effects of Psychotherapy 

We may now turn to the effects of psycho- 
therapeutic treatment. The results of nineteen 
studies reported in the literature, covering over 
seven thousand cases, and dealing with both 
psychoanalytic and eclectic types of treatment, 
are quoted in detail in Table 1. An attempt 
has been made to report results under the four 
headings: (a) Cured, or much improved; (+4) 
Improved; (c) Slightly improved; (d) Not 
improved, died, discontinued treatment, etc. 
It was usually easy to reduce additional cate- 
gories given by some writers to these basic 
four ; some writers give only two or three cate- 
gories, and in those cases it was, of course, im- 
possible to subdivide further, and the figures 
for combined categories are given. A slight 

1In one or two cases where patients who im- 
proved or improved slightly were combined by the 


original author, the total figure has been divided 
equally between the two categories. 
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Table 1 
Summary of Reports of the Results of Psychotherapy 
_ — " ‘dame ine —- a Not Jo 
Cured; im- Cured; 
much Im- Slightly proved; much 
N im- proved im- died ; im- 
proved proved left proved; 
treat- im- 
ment proved 
(A) Psychoanalytic 
1. Fenichel [13, pp. 28-40] .. 484 104 &4 99 197 39 
2. Kessel & Hyman [ 24]}.......... 34 16 5 4 9 62 
3. Jones [ 22, pp. 12-14] 59 20 % 28 3 47 
4. Alexander [1, pp. 30-43]... 141 28 42 23 48 50 
eS | & 20 7 7 67 
All cases 760 335 425 44% 
(B) Eclectic 
1. Huddleson [20] nw 200 19 74 80 27 46 
| : 775 10 310 310 145 41 
3. Maudsley Hospital 
Report (1931) 1721 288 900 533 69 
4. Maudsley Hospital 
et | i Fy 371 765 575 64 
5. Neustatter [32] sdonduaiceniie 46 9 14 8 15 50 
6. Luff & Garrod [27] 500 140 135 26 199 55 
7. Luff & Garrod [27] : . 210 38 84 54 34 68 
8. Ross [ 34] 1089 547 306 236 77 
9. Yaskin [40] ee 29 29 42 58 
10. Curran [7] : ; 83 51 32 61 
11. Masserman & 

Carmichael [29]......... 50 7 20 5 18 54 
12. Carmichael & 

Masserman [4]............. 77 16 25 14 22 53 
AS; Detetidor 095 }a ca nscncsccnic 35 11 11 6 7 63 
14. Hamilton & Wall [16]... 100 32 34 17 17 66 
15. Hamilton et al. [15]-....... 100 48 5 17 32 51 
16. Landis [26] = 119 40 47 32 73 
17. Institute Med. Psychol. 

(quoted Neustatter)........ 270 58 132 55 25 70 
a) ae 54 3 24 16 11 50 
eS A 6) ee 53 13 18 13 9 58 

Ss 7293 4661 





degree of subjectivity inevitably enters into this 
procedure, but it is doubtful if it has caused 
much distortion. A somewhat greater degree 
of subjectivity is probably implied in the 
writer’s judgment as to which disorders and 
diagnoses should be considered to fall under 
the heading of “neurosis.” Schizophrenic, 
manic-depressive, and paranoid states have 
been excluded; organ neuroses, psychopathic 
states, and character disturbances have been in- 
cluded. The number of cases where there was 


64% 
gen.sine doubt is probably too small to make 
much change in the final figures, regardless of 
how they are allocated. 

A number of studies have been excluded be- 
cause of such factors as excessive inadequacy 
of follow-up, partial duplication of cases with 
others included in our table, failure to indicate 
type of treatment used, and other reasons 
which made the results useless from our point 
of view. Papers thus rejected are those by 
Thorley & Craske [37], Bennett and Semrad 
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[2], H. I. Harris [19], Hardcastle [17], A. 
Harris [18], Jacobson and Wright [21], 
Friess and Nelson [14], Comroe [5], Wenger 
[38], Orbison [33], Coon and Raymond [6], 
Denker [8], and Bond and Braceland [3]. 
Their inclusion would not have altered our 
conclusions to any considerable degree, al- 
though, as Miles e¢ al. point out: “When the 
various studies are compared in terms of 
thoroughness, careful planning, strictness of 
criteria and objectivity, there is often an in- 
verse correlation between these factors and the 
percentage of successful results reported” [31, 
p. 88]. 

Certain difficulties have arisen from the in- 
ability of some writers to make their column 
figures agree with their totals, or to calculate 
percentages accurately. Again, the writer has 
exercised his judgment as to which figures to 
accept. In certain cases, writers have given 
figures of cases where there was a recurrence 
of the disorder after apparent cure or im- 
provement, without indicating how many pa- 
tients were affected in these two groups re- 
spectively. All recurrences of this kind have 
been subtracted from the “cured” and “im- 
proved” totals, taking half from each. The 
total number of cases involved in all these ad- 
justments is quite small. Another investigator 
making all decisions exactly in the opposite 
direction to the present writer’s would hardly 
alter the final percentage figures by more than 
1 or 2 per cent. 


We may now turn to the figures as present- 
ed. Patients treated by means of psychoanaly- 
sis improve to the extent of 44 per cent; pa- 
tients treated eclectically improve to the extent 
of 64 per cent ; patients treated only custodially 
or by general practitioners improve to the ex- 
tent of 72 per cent. There thus appears to be 
an inverse correlation between recovery and 
psychotherapy; the more psychotherapy, the 
smaller the recovery rate. This conclusion re- 
quires certain qualifications. 

In our tabulation of psychoanalytic results, 
we have classed those who stopped treatment 
together with those not improved. This ap- 
pears to be reasonable; a patient who fails to 
finish his treatment, and is not improved, is 
surely a therapeutic failure. The same rule 
has been followed with the data summarized 
under “eclectic” treatment, except when the 


patient who did not finish treatment was defi- 
nitely classified as “improved” by the therapist. 
However, in view of the peculiarities of 
Freudian procedures it may appear to some 
readers to be more just to class those cases 
separately, and deal only with the percentage 
of completed treatments which are successful. 
Approximately one-third of the psychoanalytic 
patients listed broke off treatment, so that the 
percentage of successful treatments of patients 
who finished their course must be put at ap- 
proximately 66 per cent. It would appear, 
then, that when we discount the risk the pa- 
tient runs of stopping treatment altogether, his 
chances of improvement under psychoanalysis 
are approximately equal to his chances of im- 
provement under eclectic treatment, and slight- 
ly worse than his chances under a general 
practitioner or custodial treatment. 

Two further points require clarification: 
(a) Are patients in our “control” groups 
(Landis and Denker) as seriously ill as those 
in our “experimental” groups? (4) Are 
standards of recovery perhaps less stringent in 
our “control” than in our “experimental” 
groups? It is difficult to answer these questions 
definitely, in view of the great divergence of 
opinion between psychiatrists. From a close 
scrutiny of the literature it appears that the 
“control” patients were probably at least as 
seriously ill as the “experimental” patients, and 
possibly more so. As regards standards of re- 
covery, those in Denker’s study are as stringent 
as most of those used by psychoanalysts and 
eclectic psychiatrists, but those used by the 
State Hospitals whose figures Landis quotes 
are very probably more lenient. In the absence 
of agreed standards of severity of illness, or of 
extent of recovery, it is not possible to go fur- 
ther. 

In general, certain conclusions are possible 
from these data. They fail to prove that psy- 
chotherapy, Freudian or otherwise, facilitates 
the recovery of neurotic patients. They show 
that roughly two-thirds of a group of neurotic 
patients will recover or improve to a marked 
extent within about two years of the onset of 
their illness, whether they are treated by means 
of psychotherapy or not. This figure appears 
to be remarkaby stable from one investigation 
to another, regardless of type of patient treated, 
standard of recovery employed, or method of 
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therapy used. From the point of view of the 
neurotic, these figures are encouraging; from 
the point of view of the psychotherapist, they 
can hardly be called very favorable to his 
claims. 

The figures quoted do not necessarily dis- 
prove the possibility of therapeutic effective- 
ness. There are obvious shortcomings in any 
actuarial comparison and these shortcomings 
are particularly serious when there is so little 
agreement among psychiatrists relating even 
to the most fundamental concepts and defini- 
tions. Definite proof would require a special 
investigation, carefully planned and method- 
ologically more adequate than these ad hoc 
comparisons. But even the much more modest 
conclusions that the figures fail to show any 
favorable effects of psychotherapy should give 
pause to those who would wish to give an im- 
portant part in the training of clinical psy- 
chologists to a skill the existence and effective- 
ness of which is still unsupported by any sci- 
entifically acceptable evidence. 


These results and conclusions will no doubt 
contradict the strong feeling of usefulness and 
therapeutic success which many psychiatrists 
and clinical psychologists hold. While it is 
true that subjective feelings of this type have 
no place in science, they are likely to prevent 
an easy acceptance of the general argument 
presented here. This contradiction between 
objective fact and subjective certainty has been 
remarked on in other connections by Kelly and 
Fiske, who found that “One aspect of our find- 
ings is most disconcerting to us: the inverse 
relationship between the confidence of staff 
members at the time of making a prediction and 
the measured validity of that prediction. Why 
is it, for example, that our staff members tend- 
ed to make their best predictions at a time when 
they subjectively felt relatively unacquainted 
with the candidate, when they had constructed 
no systematic picture of his personality struc- 
ture? Or conversely, why is it that with in- 
creasing confidence in clinical judgment . . . we 
find decreasing validities of predictions?” [23, 
p. 406]. 

In the absence of agreement between fact 
and belief, there is urgent need for a decrease 
in the strength of belief, and for an increase 
in the number of facts available. Until such 
facts as may be discovered in a process of rigor- 
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ous analysis support the prevalent belief in 
therapeutic effectiveness of psychological treat- 
ment, it seems premature to insist on the in- 
clusion of training in such treatment in the 
curriculum of the clinical psychologist. 


Summary 


A survey was made of reports on the im- 
provement of neurotic patients after psycho- 
therapy, and the results compared with the best 
available estimates of recovery without benefit 
of such therapy. The figures fail to support 
the hypothesis that psychotherapy facilitates re- 
covery from neurotic disorder. In view of the 
many difficulties attending such actuarial com- 
parisons, no further conclusions could be de- 
rived from the data whose shortcomings high- 
light the necessity of properly planned and 
executed experimental studies into this im- 
portant field. 


Received January 23, 1952. 
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An Experimental Investigation of a 
Psychotherapeutic Technique’ 
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A major issue in psychotherapy is the level 
at which a therapist should express his recog- 
nition of the feelings and attitudes of a patient 
[1, pp. 78, 90; 10, pp. 205, 346]. Rogers and 
his co-workers have supported the position that 
a therapist should reflect only the feelings ex- 
pressed explicitly by a patient. In contrast, psy- 
choanalytically oriented therapists have tend- 
ed to interpret the “deeper” feelings of a pa- 
tient which are inferred or implied by his 
verbal and motor behavior. As yet, there has 
been little objective and experimental verifica- 
tion of these assumptions. The Rogerians have 
presented statistical-observational evidence to 
support their contention that recognition of ex- 
plicit feelings will bring about an improvement 
in the patient [11, pp. 131-196]. Whether or 
not deeper recognition of feeling would have 
resulted in faster and greater improvement is 
not known. Other schools point to their clini- 
cal experience to show that better results are 
achieved by “deeper” recognitions. Only ex- 
perimentally controlled studies can offer any 
definite conclusions. The present research was 
an attempt in that direction. Insight was 
selected as the criterion for patient improve- 
ment because almost all modern theories em- 
phasize its role in therapeutic success [8, p. 


277; 10, p. 40). 


Previous Studies 


Few investigations in the field of psycho- 
therapy have used control groups, fewer have 
attempted to manipulate one factor while hold- 
ing others constant, and still fewer have at- 
tempted to compare various psychotherapeutic 
approaches. Notable exceptions are studies by 

1The author wishes to express his appreciation to 


Dr. S. M. Wesley, Beverly Hills, California, for 
his help in the preparation of this manuscript. 


Closson and Hildreth [2], Haggard and 
Murray [6], and Keet [7]. Only the latter 
study has a direct bearing on the present in- 
vestigation. Keet divided his subjects into two 
groups, induced an experimental conflict, and 
attemped to alleviate the conflict by what he 
terms the “expressive” technique with one 
group and the “interpretive” technique with 
the other. His results were overwhelmingly 
in favor of the “interpretive” group. From a 
survey of the protocols it would seem that 
structuring and interpositions [3, p. 69], as 
well as the expressive and interpretive tech- 
niques, were utilized. This makes it difficult 
to ascertain which factor was responsible for 
the results. Also, it is doubtful that many 
statements classified as “interpretive” were in- 
terpretations in the usual sense. Many seem 
to be interpositions. Such a statement as “Per- 
haps there is something about the word itself. . . 


se 


you may have had some experience or some- 
thing like that. . .” would seem to be appropri- 
ate in both expressive and interpretive therapy. 
Other objections to the study include inade- 
quate reflections of feeling, particularly hostili- 
ty, in the expressive group, the failure to match 
the two groups in terms of personality struc- 
ture and intellectual ability, failure to ascertain 
whether or not the therapist carried out his in- 
tent to be as permissive and accepting in one 
group as in the other, and failure to inform the 
subjects of the real purpose of the interview, 
motivating them in terms of psychotherapy. 
The present study attempted to benefit from 
Keet’s experience. 


The Experiment 


The specific hypotheses under investigation 
here are that subjects achieve greater insight 
from therapist responses which attempt to 
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recognize feelings or attitudes explicitly ex- 
pressed by the subject’s verbal and motor be- 
havior, or that subjects achieve greater insight 
from therapist responses which attempt to 
recognize feelings or attitudes inferred or im- 
plied from the subject’s verbal or motor be- 
havior. The latter is commonly referred to as 
interpretation. Insight is defined as a percep- 
tual phenomenon indicating the correctness of 
the perception of one’s own behavior, motiva- 
tion, feelings, or attitudes. 


The experimental design was as follows: 
Two groups of subjects were matched. In one 
group (designated the surface group), the 
therapist attempted to recognize explicit feel- 
ings, and in the other group (designated the 
deep group), implicit feelings. Insight inven- 
tories were administered prior to, just after, 
and four weeks following the therapy sessions. 
An attempt was made to hold constant, as well 
as at a maximum, such therapeutic variables as 
acceptance, permissiveness, understanding, 
structuring, and patient responsibility. Pa- 
tient responsibility was defined as a behavior 
pattern on the part of the therapist, indicating 
his willingness for the patient to direct and 
determine the course of the interview. 


Variables connected with the subject’s per- 
sonality and intelligence were controlled, in the 
main, by the matching technique. From a 
group of over 600 individuals, ten pairs of 
male college subjects (mostly World War II 
veterans enrolled in elementary psychology) 
were matched on the basis of their interest in 
obtaining psychotherapy and their temperament 
test scores on the Guilford-Zimmerman Temp- 
erament Survey [4]. Since all the subjects 
paired had indicated a “moderate” or “quite 
strong” interest in participating in the study, 
this served to control motivation. The two 
groups were also matched on the basis of verbal 
and reasoning ability (as measured by the 
Thurstone Primary Abilities Tests [12]), 
prognosis (as ascertained from the Thematic 
Apperception Test [9]), and insight (as 
measured by two insight inventories [5]). An 
attempt was made to screen out any subject 
who would be classified as psychotic, psycho- 
pathic, moderately or severely neurotic, or who 
had previously received psychotherapy. This 
was done through the use of the Guilford- 
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Zimmerman Temperament Survey, and the 
ten “stop items” from the Cornell Selectee 
Index [13], plus an additional item to ascer- 
tain if the individual had received psychological 
or psychiatric treatment. These last two men- 
tioned tests were administered in group form 
in the classroom, while the TAT was ad- 
ministered individually to each of the twenty 
subjects. 

No statistically significant differences be- 
tween the two groups were found on any of 
the above-mentioned measures. It must be 
stated, however, that although the objective 
evidence indicated otherwise, there were data 
which cast some doubt on the adequacy with 
which the two groups were matched. An in- 
spection of the wire-recorded interviews re- 
vealed that for some unaccountable reason the 
deep group contained a majority of the less in- 
hibited and less resistant subjects. Every sub- 
ject in the deep group entered therapy with a 
definite problem and, with little hesitancy, pro- 
ceeded to unfold and analyze it. This was not 
true of seven of the subjects in the surface 
group. Four subjects in this group claimed 
that they were more interested in the test re- 
sults than in counseling. Three additional 
subjects insisted they had no definite problem 
to explore. It would seem, then, that the de- 
sired matching of the two groups was not 
wholly accomplished. 

In order to obtain measures of insight, two 
inventories were administered individually to 
each subject prior to the therapy sessions. 
These inventories are described in detail else- 
where [5]. In brief, however, one inventory 
was based on the Guilford-Zimmerman Tem- 
perament Survey. It consisted of the twelve 
definitions of the traits measured, each followed 
by a graphic percentile scale. The subjects 
were asked to rate themselves in comparison 
with the male college population. The insight 
score consisted of the sum of the discrepancies 
between the subject’s own ratings and his 
actual scores on the temperament survey. It 
was assumed that the larger the discrepancy, 
the less insight the subject possessed into his 
behavior. The following is a sample definition 
of a trait to be rated: 


A tendency toward the following: a high degree 
of energy, overt action, and general activity; lik- 
ing for speed; and being “on the go.” A tendency 
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away from the following: inertness and disinclina- 
tion ‘for motor activity. 


The second insight inventory was based on 
interpretations derived from the TAT. It con- 
sisted of 20 to 30 four-choice items for each 
subject, covering such areas as attitudes toward 
parent figures, sex, aspiration level, and thera- 
py. The insight score represented the amount 
of agreement between the subject’s answers 
and the answers chosen for him by two psy- 
chologists, based on their interpretations from 
the subject’s TAT. The following is a sample 
item: 


In general, for things to turn out well, my moth- 
er’s affection and approval are: (a) quite impor- 
tant, (b) of moderate importance, (c) not too im- 
portant, (d) unimportant. 


Three sixty-minute interviews were held 
with each subject with the third and final in- 
terview being wire recorded and transcribed. 
This was done in order to have some check, 
other than the therapist’s own judgment, that 
his responses were varied in accordance with 
the experimental design. The therapist state- 
ments during the third interview with all sub- 
jects then were classified by two judges, the 
therapist and another psychologist,? according 
to the following three categories: (a) recog- 
nition of explicit feeling statements, (4) recog- 
nition of implicit feeling statements, and (c) 
miscellaneous statements (opening or closing 
therapist statements, unavoidable interruptions, 
asking the subject to repeat a phrase, etc.). 
As far as possible, the correctness of the 
therapist statement was not considered. 


The following are examples classified by 


both judges as being explicit feeling state- 
ments: 


Example 1 


Subject: However, I do feel that the most im- 
portant reason for my tenseness in school lies in the 
fact that I’m married and feel the responsibility for 
getting out and supporting my family. (slight 
pause) I can go around in circles and circles, and 
it all seems to come back to that one thing—that my 
wife works and I don’t work. 

Therapist: Of all the different points that we 
talked about, this one bothers you the most. 


2The author wishes to express his appreciation 
to Dr. Maurice Rapkin, Los Angeles Psychological 
Service Center, for his valuable assistance. 
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Example 2 


Subject: Yeh. I feel it is a sacrifice for her... 
I can make those things up to her if I can ever 
get through school. .. . But as far as the monetary 
factor is concerned, I don’t feel as strongly about 
that, as I do about giving her this responsibility 
and my not taking it. 

Therapist: Mmm. It would seem, then, that it 
bothers you a great deal that you’ve given some- 
one else something which you feel is your respon- 
sibility. 


The following are examples classified by 
both judges as being implicit feeling state- 
ments : 


Example 1 


Subject: Being the almost perfectionist that my 
father is, I don’t think that he thinks as highly of 
my kid brother as he used to, because, as I say, he 
Dad likes it done. 
He’s more of a spendthrift. I know that Dad, in 
the brief times that I’ve seen him, uh, usually once 
or twice a year, is not at all critical of me, and I 
like that very much. 

Therapist: Do you 
perfectionistic, 


almost does nothing the way 


think 
liking for organization, 
and so forth, might have been your instrument to 
gain the affection that you always wanted from 
your father—whom you felt had more affection for 
your brother than you? 


your becoming more 


planning, 


Example 2 


Subject: I can’t see how I can 
grades though. (long pause) I don’t know 
more I could tell you. I seem to be at a loss for 
words. 

Therapist: It might be that this “loss for words” 
is a blocking of more important aspects of your 
problem which you're avoiding talking about. 


improve my 


what 


A third measure of insight was utilized. Im- 
mediately following the last interview with 
each subject, the therapist rated on a five-point 
scale the amount of insight gained by the sub- 
ject. The therapist also rated each subject on 
his “therapeutic progress.” 

Within a day or two following the last in- 
terview with each subject, the insight inven- 
tories were readministered in order to ascertain 
the degree of insight possessed by the subjects 
after the experimental therapy sessions. The 
inventories were administered again after ap- 
proximately four weeks in order to observe the 
permanence of this effect. At this latter time 
eight four-point rating scales (described in de- 
tail in the next section) were also administered 








328 


in order to ascertain: (a) whether the thera- 
peutic variables which the therapist wished to 
hold constant and at a maximum were per- 
ceived as such by the subjects, (6) whether the 
therapeutic variable which the therapist wished 
to manipulate was perceived by the subjects 
as intended, and (c) the extent to which the 
subjects felt the therapy sessions were bene- 
ficial. ‘These ratings were made anonymously 
in order to allow as much freedom as possible 
for the expression of negative attitudes. 


Statistical Results 


Results on the classification of the therapist 
statements. Close agreement obtained between 
the judge and therapist on their classification 
of the therapist statements. These classifica- 
tions were made independently. The mean 
percentage of agreement between the judge 
and therapist on the classification of statements 
made in the surface group was 86.4, and in the 
deep group, 86.5. The range in the surface 
group was 75 to 96 per cent, and in the deep 
group 80 to 90 per cent. 

Table 1 shows the results of the classifica- 
tion of the therapist statements. In general 
it indicates that the therapist was highly suc- 
cessful in varying his statements, recognizing 
mainly explicit feelings in the surface group 
and implicit feelings in the deep group. 


Table 1 


Mean Percentages of Therapist Statements in the 
Surface and Deep Group as Classified by 
the Therapist and Judge 











Surface (N10) Deep (N=—10) 

tion Thera- Thera- 
category pist Judge pist Judge 
I—Statements to 

recognize 

explicit 

feelings 84.3 83.1 12.2 15.8 

Range 70-92 70-96 3-24 7-24 
II—Statements to 

recognize 

implicit 

feelings 8.¢ 10.1 75.5 74.5 

Range 4-20 0-23 64-84 66-83 
Til—Miscellan- 

ous 

statements 6.6 6.8 11.6 9.7 

Range 4-10 3-10 7-17 6-15 
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Results on the subjects’ perception of the 
therapist's attempt to hold constant or vary 
therapeutic variables. One of the purposes of 
the rating scales was to ascertain whether the 
therapeutic variables which the therapist 
wished to hold constant or vary were perceived 
by the subjects as intended. The results are 
presented in Table 2. The ¢ ratio was used 
throughout. 

Scale 1 was as follows: 


To what extent did you feel the therapist really 
understood you as an individual, your problem(s), 


worries, and tensions? 


No differences existed between the groups 
and both felt that they were “well” to “almost 
completely” understood. 

Scale 2 was designed to ascertain whether 
the subjects perceived the therapist levels of 
recognition statements as intended. It read: 

To what extent do you feel that the therapist at- 
tempted to point out some of the causes for your 


problem(s) or worries? 


Reference to Table 2 will indicate that the 
surface group felt that the therapist only 
“sometimes” pointed out the causes, while the 
deep group felt he “frequently” did. The sta- 
tistical difference between the two groups is 
significant at the 2 per cent level of confidence. 
It would seem then, that from the subjects’ 
point of view the therapist made significantly 
fewer interpretations in the surface group than 
in the deep group. 


Scale 3 was designed to measure whether the 
subjects felt that the therapist was dominating 
or authoritarian. It read: 


How much pressure did you feel to agree with 
the statements made by the therapist? 


There was no difference between the two 
groups, and both felt from “none” to a “slight” 
amount of pressure. 


Scale 5 was intended to measure how ac- 
cepted the subjects felt. It read: 
How did you feel the therapist felt toward your 


ideas and actions outside of the counseling sessions, 
that is, your life in general? 


Here again, there was no difference, and 
both felt the therapist was “generally accepting 
— did not seem to be judging my ideas either 
way.” 

Scale 6 was designed to measure permissive- 
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ness. It read: 


How free did you feel in the counseling sessions 
to divulge information which you considered of a 
very personal nature? 


The group as a whole felt “moderately” to 
“very” free to divulge very personal material, 
and there was no difference between the two 
groups. 

The results from Scale 4 proved to be in- 
teresting. Table 2 reveals that there was a sig- 
nificant’ difference between the two groups 
(close to the 1 per cent level). It read: 

How much responsibility did you feel the thera- 


pist placed on you to work out or come to an un- 
derstanding of your problem(s) or worries? 


The surface group felt from a “moderate” 
to “a great deal” of responsibility, while the 
deep group felt from a “slight” to a “moder- 
ate” amount. This therapeutic variable then, 
was not held constant. The group as a whole 
felt more than a “moderate” amount of re- 
sponsibility to work out their own problems. 


Table 2 


t Ratios of Differences Between Mean Ratings on 
Post-Interview Rating Scales by Subjects in 
the Surface and Deep Groups 














° 

3 # 
rf ‘~ ¢ & 
3s . ae , 
1 270 4 270 3 270 72 .00 
2 1.90 .83 2.90 .83 240 .97. 2.74 
3 1.80 .98 1.70 .90 1.75 .94 -0.23 
4 so 2 as. $.10 33 -296 
5 2.20 60 2.50 .87 2.40 .73 86 
6 239 647 236 38 260 58 
7 2.50 1.00 2.70 .90 2.60 1.02 45 
8 2.80 .75 3.50 67 3.15 .79 2.09 





*For an N of 20 with 18 degrees of freedom, a t ratio 
of 2.10 is required for significance at the 5 per cent 
level ; 2.55 at the 2 per cent level; and, 2.88 at the 1 per 
cent level. 


Results on the differences in insight between 
the two groups. Tables 2, 3, and 4 show the 
results from the various devices used to ascer- 
tain the differences produced between the two 
groups. The ¢ ratio was used throughout. 
From Table 3 it can be seen that the insight 
inventories did not show any significant dif- 
ferences between the two groups, although the 
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direction was consistently in favor of the deep 
group. The ratings of the therapist did show 
significant differences in favor of the deep 
group. (It should be stated that the therapist’s 
therapeutic bias was in favor of the surface 
group.) It can be seen, also, from Scale 8, 
Table 2, that the subjects in the deep group 
rated the counseling sessions more beneficial 
than did those in the surface group. It read: 
In general, taking everything into account, how 


would you evaluate the counseling sessions? 


The subjects in the deep group felt that the 
sessions were from “moderately” to “quite” 
beneficial, while the subjects in the surface 
group felt the sessions were from “slightly” 
to “moderately” beneficial, but much closer to 
“moderately.” 


Table 3 
t Ratios of Differences Between Mean Scores on 
Insight Inventories I and II, and Therapist 
Ratings on Insight and Progress 
Inventory . ¢ 
Insight Inventory I 
Post-interview test 70.1 689 73.8 7.12 1.12 
Follow-up test 68.9 941 74.2 7.97 1.29 
Insight Inventory II 
Post-interview test 48.6 9.79 56.8 13.31 1.49 
Follow-up test 46.5 13.29 54.3 12.55 1.28 
Therapist Ratings 
Insight 2.1 1.24 3.2 87 2.27 
Progress 1.8 87 28 1.12 2.1 





*For an N of 20 with 18 degrees of freedom, a t ratio 
of 1.73 is required for significance at the 10 per cent 
2.10 at the 5 per cent level; and 2.55 at the 2 per 
cent level. 


level ; 


Scale 7 was designed to shed some light on 
a rather important hypothesis, which needs 
much more elaborate and thorough investiga- 
tion. The hypothesis, offered mainly by those 
of the Rogerian school, is that deep interpreta- 
tions arouse sufficient anxiety to impede learn- 
ing. Interpretative therapists, on the other 
hand, deny this. The scale read: 


How much tension and anxiety do you feel you 
were under during the counseling sessions? 
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If the subject’s judgment can be accepted as 
an adequate criterion, the results point to the 
tentative conclusion that deep interpretations 
do not produce more anxiety than surface ones, 
at least at the conscious level. A side study, 
however, adds further information. This study 
was initiated to investigate the hypothesis that 
deep interpretations arouse more resistance to 
therapy than surface ones. Resistance was de- 
fined operationally in terms of absences or 
tardiness (after 10 minutes) for an interview. 
The difference between the two groups almost 
approached the 10 per cent level of confidence. 
While this difference does not reach acceptable 
limits, it does lend a little support to the 
Rogerian contention that more resistance to 
therapy is raised by deep interpretations. The 
methodology involved here would seem to be 
the more important aspect of the study. 

Results dealing with changes in insight. In 
general, the results presented in Table 4 in- 
dicate that the surface group did not show any 
significant gains or losses between their pre- 
interview, post-interview, and follow-up test 
scores on either insight inventory. The deep 
group, however, made significant gains between 
their pre-interview test scores and post-inter- 
view test scores on Insight Inventory II, and 


Table 4 

t Ratios of Differences Between Mean Scores on 
Insight Inventories I and II, in the Pre-, 

Post-, and Follow-up Testing 





Means 





Score Follow- t* 
Pre Post up 
Insight Inventory I 
Surface group 70.5 70.1 _-- -.02 
70.5 —_ 68.9 -.69 
—_ 7 68.9 -.61 
Deep group 70.8 73.8 —_ 1.68 
70.8 -- 74.2 2.38 
_— 73.8 74.2 21 
Insight-Inventory II 
Surface group 48.2 48.6 os 17 
48.2 os 46.5 -.48 
_ 48.6 46.5 -.93 
Deep group 49.8 56.8 oo 2.53 
49.8 _ 54.3 1.28 
— 56.8 54.3 -1.41 





*For an N of 10 with 9 degrees of freedom, a t ratio 
of 1.83 is required for significance at the 10 per cent 
level ; 2.26 at the 5 per cent level; and 2.82 at the 2 per 
cent level. 
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between their pre-interview and follow-up test 
scores on Insight Inventory I. 


Discussion 


Despite the fact that the two groups were 
not matched on all factors, certain conclusions 
sectn Wal ranted and worthy of discussion. Of 
major importance are the gains made in insight 
by the deep group and the significant difference 
between the two groups both on insight and 
therapeutic progress, according to the thera- 
pist’s judgment. The question arises as to 
whether or not the insights gained were of an 
Evidence that 
emotional insight took place in the deep group 
is indicated in part by a survey of the trans 
cribed interviews. Many subjects not only ac- 
cepted the interpretations but offered addition- 
al confirmatory material. Others spontancous- 
ly reported changes in the manner in which 
they perceived the environment. Other evi- 
dence that some emotional insight took place 
could be inferred from the fact that the sub- 
jects in the deep group felt that the sessions 
were “quite” beneficial. It would seem then, 
that the subjects in the deep group did gain 
significantly in emotional insight, while those 
in the surface group did not. The evidence 
taken as a whole tends to reject the first hy- 
pothesis offered (namely, that greater insight 
is achieved by therapist recognition of explicit 
rather than implicit feelings) and to confirm 
the second (namely, that greater insight is 
achieved by therapist recognition of implicit 
rather than explicit feelings). 

The fact that the therapist tried, but was un- 
able, to hold constant “patient responsibility,” 
seems due to the inherent nature of recogniz- 
ing implicit feelings. In a sense, this practice 
results in directing the subjects’ attention to 
factors of which by definition they are un- 
aware. In turn, they are relieved of some of 
the responsibility. The study, also, revealed 
that having to take the full responsibility for 
directing the interviews came as a distinct 
shock to many patients, especially overly de- 
pendent ones. This may have resulted in their 
inability to gain insight. Recognizing implicit 
feelings seemed to relieve patients of undue 
responsbility, as well as to facilitate insight. 

This research suffers from a number of 
limitations: the inability to achieve perfectly 


intellectual or emotional nature. 
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matched groups, the inability to hold constant 
all dependent therapeutic variables, the small 
number of interviews, and the difficulty in dif- 
ferentiating intellectual from emotional in- 
sight. On the other hand, objective criteria 
were established and success was achieved in 
manipulating the independent variable, as well 
as holding constant all but one of the dependent 
ones. This lends strong support to the conten- 
tion that psychotherapy can be studied objec- 
tively and experimentally. 


Summary 


The hypothesis under investigation here 
was that greater insight is achieved by ther- 
apist recognition of explicit rather than im- 
plicit feelings of the subjects, or that greater 
insight is achieved by therapist recognition of 
implicit rather than explicit feelings of the 
subjects. Two groups of ten male subjects were 
matched on the basis of personality, insight, 
verbal and reasoning ability, prognosis, and 
motivation. In one group the therapist recog- 
nized only explicit feelings and in the other 
only implicit feelings. Insight measures were 
administered prior to, just after, and four 
weeks following three sixty-minute individual 
interviews, which were wire recorded. The 
results indicate that the matching was success- 
ful except for a preponderance of resistant sub- 
jects in the surface group; the therapist was 
able to manipulate the experimental variable as 
intended ; the two insight inventories revealed 
no differences between the two groups while 
the therapist ratings did, in favor of the deep 
group; and the deep group made significant 
gains in emotional insight while the surface 
group did not. In addition, the study supports 
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the contention that psychotherapy can be 
studied objectively and experimentally. 
Received January 23, 1952. 
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Client-Centered Therapy and the 
Involuntary Client 


Ernst G. Beier 


Syracuse University 


It is generally recognized that the most 
favorable condition for successful psychothera- 
py exists when the client himself feels ready to 
seek help for his problems. Rogers [8] lists 
the client’s wish for help as one of “the most 
significant steps in therapy.” Fenichel [3] 
states ‘the method of psychoanalysis is based 
on the cooperation of a _ reasonable ego.” 
Freud’s “basic rule” certainly can only be fol- 
lowed by an analysand who desires to be helped 
by analysis. 

However, there is increasing evidence that 
therapists of all orientations have been called 
upon to make their services available to clients 
who do not seek their assistance. We need 
only to think of the efforts of some workers to 
give psychotherapeutic treatment to juvenile 
delinquents [4], to prisoners [7], to court re- 
ferrals [6], to employees [2], to students re- 
ferred by deans’ offices, and, last but not least, 
to children who, after all, rarely come to the 
clinic of their own free will. 

It is the intent of this paper to discuss meth- 
ods available to the therapist to assist the “in- 
voluntary” client—the client who does not 
make the decision for therapeutic help him- 
self—to make appropriate use of the thera- 
peutic hour. It should be understood from the 
outset that assisting an involuntary client 
toward ‘“‘therapy-readiness’ can never mean 
that the therapist is attempting to make a cli- 
ent out of a nonclient. Rather, the therapist 
can hope to work with an “unready client” 
only if he deals with a client in the first place, 
i.e., an individual who engages in maladaptive 
behavior and who has some wish to free him- 
self of such behavior. An involuntary client, 
then, would be an individual in whom resist- 
ance toward giving up symptoms and sub- 
stitute gratifications is greater than his desire 


for help. With an involuntary client, the 
therapist must still attempt to support the cli- 
ent’s own motivation for treatment. 

The phase of the psychological contact which 
deals with supporting the client’s own readi- 
ness for therapeutic help shall be called here 
the “pretherapeutic phase.” In clinical experi- 
ence, various types of pretherapeutic phases 
seem to be recognized. As yet, no information 
is available to indicate that one approach is 
more successful than another. It is likely that 
none of the approaches practiced can be “suc- 
cessful’’ per se, but that each approach can 
achieve its maximum gain in dealing with 
specific syndrome-constellations. Accordingly, 
in the following presentation, we are not aim- 
ing at finding a panacea. All we wish is to 
present a few incidents with involuntary cli- 
ents, follow them through to success or failure 
(does the client become ready to seek help on 
his own?), and discuss their implications, per- 
haps with a hint here and there concerning 
how this problem can be investigated more 
properly [5]. We will, however, present the 
incidents in some order, being guided in this 
respect by our recognition of some types of 
“pretherapeutic’’ approaches which can easily 
be distinguished. 

In our experience, counselors who are con- 
fronted with the involuntary client generally 
deal with him in one of the following ways: 


A. They do not accept him, but “wait him 
out,” until he is ready to seek help on his own.* 


B. ‘They accept him and actively engage in 
resistance reflections. 


1Category' A actually deals with “involuntary 
clients” who can absent themselves and those who 
cannot absent themselves from the therapeutic hour. 
The latter, who cannot absent themselves from the 
hour (court referral), present an extra problem. 
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C. They accept him and discuss with him 
in an “above-board” fashion the fact that he is 
considered a client and the ways he can go 
about helping himself (anxiety-arousal). 


Below we will present incidents and discus- 
sions relative to each of these major approaches 
for dealing with the involuntary client. 


A. The counselor does not accept the in- 
voluntary client but decides to “wait him out” 
until he is ready to seck help on his own 


Incidents: 


1. Billy, aged 7, was referred to the clinic be- 
cause of aggressive behavior. His parents brought 
Billy to the clinic against his will and he refused 
to follow the play therapist to the playroom. The 
worker accepted Billy’s refusal and permitted him 
to stay downstairs in the waiting room. The par- 
ents, themselves engaged in a counseling hour, were 
given to understand that it was felt desirable for 
Billy to make his own decision and they accepted 
this. For the second session, Billy stayed for only 
10 minutes and left again. In the third session, 
Billy stated that he really liked to come, and stayed 
for the whole hour. At this time the parents re- 
ported that even after so few contacts Billy’s be- 
havior had changed markedly. To what extent this 
change was due to parent counseling was not clear 
but the worker felt that Billy’s early freedom of 
choice had much to do with it. 


2. Johnny, a withdrawn child, aged 7, did not 
want to enter the playroom. He was given free 
choice to leave. The mother, in a counseling ses- 
sion, was informed as to the purpose of the free 
choice for Johnny. After three meetings, during 
which Johnny never entered the playroom, the 
mother decided to leave. She stated that Johnny 
was too immature to make such a decision. The 
workers felt that the mother would have given the 
clinic more time if Johnny had been taken to the 
playroom. She felt that by offering Johnny a free 
choice, he was presented with an unnecessary, and 
at this time insurmountable, problem. 

3. Brown, a student, referred by the dean’s office 
because he “literally fell asleep in class,” protested 
during the first hour that he had no problems and 
did not wish to come to the clinic. The therapist 
structured to him that he did not have to come, 
even though the dean’s office had sent him. Brown, 
first making certain that he had understood cor- 
rectly, almost instantaneously began to make use 
of the hour. 

4. Smith, a young woman, was referred to the 
clinic by the dean’s office for “emotional rehabilita- 
tion.” Smith felt that she did not want to come to 
a psychological clinic. She was given free choice in 
the matter of entering into a counseling relation- 
ship and decided to leave. After two months the 


report came to the clinic that Smith had had a ner 
vous breakdown and had left school. 

5. An athletic team was requested by the ad 
ministration to report to the psychologist for the 
purpose of helping with the team’s esprit de corps. 
The members of the team, deeply resentful of their 
coaches, transferred this resentment toward the psy- 
chologist and made an agreement that none of them 
would talk. The psychologist structured to them 
at their first session that they did have to be pres- 
ent but did not have to talk. As was hoped, a long 
discussion ensued which was directed toward their 
relationship with their coaches. The very permis 
sion “not to talk” had effected in the team a feel- 
ing that the psychologist was on their side 

6. A student had to enter a counseling relation- 
ship by court order. He sat through twelve silent 
hours with the therapist. The therapist had struc 
tured to him that he did not have to talk and the 
client accepted this at face value. Reflections of si- 
lence were recognized by the client but not utilized. 
From the diagnostic record, severe emotional dis- 
turbances were indicated, and the worker felt by 
the end of the time that no gains had been made, 
that the silence had been a sign of defiance. Fol 
low-up information supported this diagnosis and 
the worker’s opinion. 


Discussion. In all these incidents, the ther- 
apist’s intent was to assist the involuntary cli- 
ent in his motivation toward therapy and 
growth. Following the thinking of client- 
centered theory, the therapist communicates. to 
the client from the start that he respects the in- 
tegrity of his client, that he does not identify 
with the referral agency, and that he has faith 
and confidence in the client’s ability to be a 
person and to make his own decisions. The 
quickly achieved “therapy-readiness” in Cases 
1, 3, and 5 supports the therapist’s decision to 
offer free choice to the involuntary client from 
the start. The clients clearly benefited from 
this acceptance and seemed to utilize it well. 


An analysis of Cases 2, 4, and 6, however, 
indicates that the same approach—with dif- 
ferent clients—may fail. Here, too, the ther- 
apist tried to communicate his acceptance, but 
the clients, particularly 2 and 4, were not 
ready to sense permissiveness or acceptance. 
Throughout their life experience they had pos- 
sibly never been asked to make a major de- 
cision by themselves, and now they were sud- 
denly presented with this opportunity. They 
were not ready, not free enough emotionally, 
to make this decision objectively. Perhaps 
these two clients may even have been burdened 
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by this choice. ‘They were confronted with an 
additional difficult problem. The therapist’s 
communication, “you alone can decide whether 
or not you need me,” might have been easily 
misunderstood by them as another rejection. 
They had never had the opportunity to get to 
know the therapist and might have misinter- 
preted his statement as one of indifference. 
Case 6, the “forced” involuntary client, il- 
lustrates very neatly how the client persisted 
in thinking of the therapist in defiance. The 
therapist had accepted the court referral be- 
cause of pressure (“‘you take him, or off he 
goes’), but he had given the boy free choice 
to use the hour as he wished. Apparently, the 
boy was so preoccupied with the idea that he 
was forced into this situation that he never 
could sense the therapist’s acceptance. There 
were indications that he perceived permissive- 
ness as hypocrisy. 

Misunderstanding, misinterpretation, and 
the possible lack of readiness to admit and 
sense permissiveness are hindrances in making 
this pretherapeutic phase effective with some 
involuntary clients. It seems to be effective 
with clients of a certain maturity, but fails to 
work where emotional factors are in the way. 
The obvious questions that one would have to 
investigate more closely would be: What are 
these emotional factors that prevent some 
clients from being effectively reached? What 
are the behavioral correlates? Which cues can 
the therapist take into account to determine 
whether or not he should give free choice to a 
client without taking the risk of losing him al- 
together (Case 4)? Are other, more adequate 
methods available? 

It should be carefully noted that we are 
speaking of approaches to therapy and not of 
the basic attitudes of the counselor. The re- 
spect for the integrity of the client and the 
acceptance of the client are not intended to be 
placed in doubt as basic requirements. When 
we speak of various approaches to helping mo- 
tivate the involuntary client, we mean ways 
of responding to very real needs of the client. 


B. The counselor accepts the involuntary 
client and engages in resistance reflections 


Incidents: 


1. The mother of a young child, who, at an 
earlier contact, had refused to enter into a counsel- 
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ing relationship, was asked to “give it a chance.” 
The staff had decided, in this case, not to take 
the child (aged 3) unless the mother would also 
accept counseling for herself. The mother reluc- 
tantly agreed. She stated in her first session that 
she really had no problem and that she did not 
know what she could discuss, The therapist re- 
flected her anger about being pushed into the coun- 
seling also commented during the 
hour on other manifestations of resistance, her em- 


situation. He 


barrassment (perpetual laughing), her ambivalence 
(twice leaving the room), her difficulty in starting 
(silences, laughing, stating that she was talking in 
circles), and finally her very real wish to obtain 
help herself (although not based on expressed com- 
munication). What seems here to have been a mer- 
ciless onslaught on the part of the therapist was 
understood by the client as very deep concern for 
her, as supported by the client’s own “testimonial” 
after the pretherapeutic phase had long since passed 
into a therapeutic one: “For the first time in my 
life (during the first hour), I felt that somebody 
was concerned with my troubles.” The therapist felt 
that real effort to 
understanding to the client, the client would have 


without his very communicate 
maintained her feeling that nobody could be inter- 
ested in her feelings and would have stayed out of 
the relationship. It should be noted that consider- 
able dependency had been established between the 
therapist and the client which had to be worked 
through in later contacts. 

2. Another involuntary client, a young woman, 
came to the clinic and told the therapist that her 
husband had sent her because she was “upset too 
easily.” She expressed some resentment toward her 
husband’s behavior, particularly that he had not 
come with her. She stated that she did not know 
what to talk about and thought that her husband 
certainly would not want her “to give away se- 
crets.” She was seen for six one-hour sessions, in 
which she either kept silent or discussed her bus 
trip to the clinic. In staff discussion, she was seen 
as an involuntary client who was badly in need 
of help and who was unable to break through her 
feelings of resistance. In the seventh hour, the ther- 
apist, in order to help her to feel more deeply under- 
stood and to establish a more therapeutic relation- 
ship, reflected to her a feeling that was implicit in 
many of her previous statements. She had never 
directly expressed the feeling, of which she was 
either unaware or could not yet bring herseif to 
communicate, namely, hostility toward her husband. 
She felt her husband seemed to hold her respon- 
sible for all difficulties in the family. A statement 
to that effect, however, although cautiously word- 
ed, did not at all assist her towards a deeper rela- 
tionship, but increased her resistance to the point 
where she had good reason to break the contact. 

3. Another involuntary client, a delinquent boy 
of 15, had been accepted for therapy by court or- 
der. He had decided that he did not need us as he 
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was not “nuts” and for two hours the boy main- 
tained silence, a remarkable feat for a 15-year-old. 
We were convinced that the boy had severe emo- 
tional problems (he had been caught prowling in 
a number of buildings, stealing women’s dresses), 
and decided in staff conference to consider reflec- 
tion of resistance. The therapist reflected the boy’s 
unexpressed feeling of bewilderment in this situa- 
tion, his unexpressed fear that this might not be 
confidential, his fear of being called “nuts,” his 
resentment and helplessness (biting of fingernails), 
and the stress under which the silence placed him. 
Rapport was established when the therapist used 
four-letter words to reflect the boy’s feeling of an- 
guish. He began to feel more at ease and work pro- 
ceeded on more therapeutic lines. The therapist had 
established himself as a person who did not iden- 
tify himself with authority, and in effect the boy 
could feel that he was being understood. 

4. A girl was sent over by the dean’s office be- 
cause she was unable to do her schoolwork effici- 
ently. She claimed that she was blind in one eye, 
although medical examination revealed no defect. 
The student stated that she had no problems and 
accepted her blindness without any signs of anx- 
iety. She stayed with the therapist for a number of 
hours which seemed most unproductive. She wouid 
say that she knew that she had to come to the clinic 
but that there was nothing wrong with her. After 
twelve contacts, a more active pretherapeutic phase 
was recommended by the staff. During the next few 
hours, the therapist reflected her unexpressed am- 
bivalence about coming to the clinic (there was no 
pressure from the dean’s office), her unexpressed 
worry about the effects of her symptom (fear of 
getting behind in her classes), her feeling about the 
therapist (fear of dependency), and her anger 
about the university (the dean had discussed with 
her suspicions of sexual promiscuity). The ther- 
apist felt that the effect of his closer participation 
and his communication of concern helped the girl 
to become more thoroughly motivated toward the 
counseling relationship. The student brought more 
intimate material for discussion and the contact 
lasted for some 35 hours. The girl left school and 
recent reports state that she is free from her symp- 
toms. 

5. Another involuntary client, a student who 
had been referred to the clinic by the student court, 
had four contacts during which he was clearly 
evasive. He would pointedly talk about the weath- 
er and related subjects. From the record, it was evi- 
dent that the student was under great tension; he 
was also known to engage in homosexual behavior. 
After the fourth hour, the therapist began to par- 
ticipate more readily, reflecting on the meaning of 
the evasive comments, reflecting unexpressed feel- 
ings such as resentment over being sent in and the 
fear that he might reveal unpleasant material dur- 
ing the hour. No change in behavior on the part of 
the client was noted, and he came for the remain- 
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ing eight hours with the very same attitude we 
had witnessed in the beginning. (The student court 
was requested not to use the clinic in this manner 


in the future.) 


Discussion. We assume that the therapist 
client to be- 
“involuntary” 


‘ 


can only help an “involuntary” 
come a motivated client if the 
client is an individual with crippling problems, 
and who is either unaware of them, or does not 
wish to communicate a need for assistance. 
The approach utilized in the above incidents is 
one in which the therapist tries to assist the 
client toward accepting therapeutic help by re- 
flecting unexpressed and unrecognized needs 
(resistance reflections) in order to communi- 
cate to the client his deep concern for him [1 }. 
The therapist’s attempt to understand the 
client better than he understands himself is 
meant to be supportive in nature, a support 
needed to work through feelings of resistance. 

While advantages 


approach A are closely related to the maturity 


and disadvantages of 
of the client (will he sense the acceptant at- 
the present 
directly involving the activity of the counselor. 


mosphere ?), approach is more 
The maturity of his judgment would be an 
important variable. The bias of the free-choice 
approach (A) is: “I behave toward the 
untary client as if he can help himself (and I 
take the risk that he cannot).” This stands in 
contrast to the bias of the second approach 
(B): “I behave toward the involuntary client 
as if he needs my support (Cases 1, 3, and 4) ; 
and I take the risk that my specific support is 
unacceptable to him (Cases 2 and 5).” The 
therapist not only has decided to give support, 
but has to decide throughout the contact where 
to give support. Such an attitude on the part of 
the therapist seems to contradict orthodox 
client-centered counseling, and yet the contra- 
diction is only superficial. In the cases cited, 
the therapist’s basic attitude has been to base 
the counseling on the second tenet of client- 
centered counseling: to present a nonthreaten- 
ing situation. Support was given not to estab- 
lish the authority of the counselor, but to help 
the client to perceive the deep concern of the 
counselor for him and his unexpressed needs. 
It seems clear to us that with support, some 
clients will be reached who otherwise would 
leave the situation. The obvious questions 
which arise with such an approach would be 


invol 
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the following: Who are the clients who would 
profit by such support? What sort of behavior 
will we have to differentiate to make a decision 
with confidence? At which state of his own 
development as a therapist is the worker 
ready to make “adequate” judgments with 
respect to the client? Answers to such 
questions will help us to evaluate the effective- 
ness of various types of pretherapeutic ap- 
proaches. 


C. The counselor discusses in an “above- 
board” fashion the fact that the involuntary 
client is really a client (anxiety-arousal) 


Incidents: 


1. A group of parents had been asked to par- 
ticipate in counseling along with play 
therapy For seven hours these 
parents discussed their children’s problems almost 
exclusively. The therapist felt that the parents were 
operating according to a preconceived set and that 
there was some need for reorientation. At the eighth 
hour, he started the session by considering with the 


sessions 
for their children. 


parents the need for discussing one’s own problems 
in order to understand other people’s problems a 
little better. He put it squarely up to them to see 
themselves as clients. 


The parents reacted in an expected way with 
some confusion and some silence. This phase of 
the contact was understood by the therapist as a 
time of reorientation in which the older defenses 
(talking about children only) no longer worked. 
The therapist reflected the silence and the aroused 
anxieties of the parents. Our records indicate that 
during this hour, as well as the following hours, the 
parents discussed their own relationships and prob- 
lems to a much larger extent and the counselor felt 
that he had a closer relationship with the parents. 


2. An involuntary client, a school teacher who 
had been sent to the clinic by her supervisor, dis- 
cussed some personal problems of her supervisor 
and the effects of these problems on the faculty. 
Afterwards she inquired of the worker what she 
was to do in this hour, and the worker structured 
the hour to her as an opportunity during which she 
would be able to talk about anything she wished. 
He also stated something to the effect that some- 
times people can be helped to become happier when 
they have an opportunity to talk about themselves 
and their problems. This statement was apparently 
very threatening to the teacher. She said to the 
counselor: “I only talk to God about my problems,” 
and left the contact. 

3. A student was referred to the clinic (and to 
the infirmary) by the dean’s office. The student suf- 
fered from paranoid delusions and insisted that 
other students stuck pins into his legs. He ration- 
alized his being sent to see a psychologist with a 
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very neat delusionary system. He demanded that the 
psychologist attend to his mother who, in his own 
words, “the queerest you 
seen.” He described certain behaviors of his moth- 
er which were clearly delusionary (sleeping with 
his uncle the very day that he, the student, had 
poisoned his father). The student stated that he 
did not wish to come back until the psychologist 
had evaluated his mother’s state of mind. The ther- 
apist, in order to motivate the student toward work- 
ing through his own problems, discussed with him 


was woman have ever 


for some four continuous hours various aspects of 
the situation as the student saw them. During this 
phase, the therapist accepted the student’s delusions 
but also participated with such observations as the 
fact that the student would be in a much more 
favorable position if he himself would be less tense. 
Although anxieties 
therapist, it appeared that these anxieties, when 
recognized, deepened the rapport. The student saw 
the therapist daily during the following week at 
the end of which he was referred to a therapist 
near his home (as he had to leave school), now 
more motivated 


certain were aroused by the 


highly toward seeking help on 


his own, 


Discussion. The “above-board” method, 
telling the involuntary client that he is a client 
and should behave as one, is a crude method at 
best. It can be understood as an attempt by 
the therapist to support the involuntary 
client’s own motivation toward help by arous- 
ing motivating anxiety. Such a method would 
seem clearly to contradict the basic tenets of 
client-centered therapy. And yet, in a search 
for appropriate ways of dealing with various 
clients, the therapist’s active participation, his 
attempt to arouse motivating anxiety which 
will help the involuntary client to become more 
ready to accept therapy, may very easily have a 
place in special cases. Case 1, perhaps, is not a 
good case in question. It can be argued that the 
therapist had a preconceived notion as to what 
parents should talk about and his relationship 
developed more readily when the parents ac- 
quiesced. Case 3, on the other hand, seems to 
be more specifically suited to this approach 
and therapists who have worked with schizo- 
phrenic patients may have had occasion to 
wonder about this problem. It is here, with 
special clients, that client-centered therapy will 
have to expand its views. Again some obvious 
questions come to mind: Which are the best 
methods to arouse just enough anxiety so that 
it is motivating and yet does not serve to ter- 
minate the contact (Case 2) ? When is anxiety 
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arousal the most adequate pretherapeutic pro- 
cedure? How, precisely, is the therapist’s 
challenge perceived by the client? If we have 
even tentative answers to some of these ques- 
tions, the therapist may be in a position to 
differentiate his clients’ needs more clearly. 


Concluding Statement 


It is probably true that we will never have 
a science of psychotherapy which takes all 
variables into account. Each client, and each 
therapist, is an individual in his own right, and 
not too consistent an individual at that. 
What we can hope for, at best, is a more 
thorough understanding of the therapeutic 
process and this by way of gathering many 
observations. Such observations may indicate 
general trends which hold true for many 
clients and many therapists. With regard to an 
evaluation of a pretherapeutic procedure de- 
signed to assist the involuntary client in be- 
coming a motivated client, we face the very 
same difficulties. We will never know if a 
given approach was just the right one with a 
given client, or if another approach would 
have worked as well or even more effectively. 
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We can, however, make it our practice to ob- 
serve closely, to report and evaluate our ob- 
servations, and last but not least, become psy- 
chologists who ask the right questions. 
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Client-Centered Therapy with Parents 
Clark E. Moustakas and Greta Makowsky’ 


The Merrill-Palmer School, Detroit 


In recent years the client-centered approach 
has been used in play therapy with emotionally 
disturbed children, retarded children, and nor- 
mal children who have situational problems ; 
in group therapy with children and adults; in 
work with physically handicapped children; 
and in work with neurotic and _ psychotic 
adults. In each of these uses counselors have 
found it necessary to develop particular tech- 
niques and principles, making some modifica- 
tions of the basic approach while still adhering 
essentially to a client-centered philosophy. 

However, little attention has been given 
thus far to the use of client-centered counsel- 
ing with parents who are primarily seeking 
help for their children. Our purpose is to 
present some problems we have met us client- 
centered counselors working with parents and 
some tentative suggestions, based on our ex- 
perience, as to ways of handling these difficul- 
ties. 

The cases presented are from the records 
of a psychological counseling service which in- 
corporates psychoanalytically-oriented and non- 
directive approaches and offers informational, 
advisory, diagnostic, and treatment services to 
children, adults, and families of a large metro- 
politan community. Some parents are referred 
by various community agencies and professional 
people; others come directly, self-motivated. 
Only cases in which a client-centered philoso- 
phy was maintained are used in this discussion. 

Parents bring a variety of children’s prob- 
lems to the service. Before a child is accepted 
for nondirective piay therapy, a client-centered 
counselor sees one or both parents, so that the 
problem can be described and arrangements 
made for the child. These parent contacts may 


1The authors, respectively a psychologist and a 
graduate student and assistant at the Merrill- 
Palmer School, wish to acknowledge the assistance 
of Dorothy L. Tyler, Editor of Publications, in pre- 
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or may not continue as the child proceeds 
through therapy. The parents have the option 
of seeing a nondirective counselor on a regular 
or intermittent basis, according to their felt 
needs. 


Problems in Counseling Parents 


Certain problems persistently recur in our 
counseling with parents. Experience has shown 
that optimum results in client-centered coun- 
seling are generally obtained with clients who 
see their problems as focused in themselves and 
who are motivated to obtain help for them- 
selves. Further, this approach has been most 
successful with clients who are neither aggres- 
sively dependent upon the therapist nor un- 
willing to assume responsibility in solving their 
own difficulties. Many parents who come to a 
family counseling service do not see the child’s 
difficulties as a reflection of their own conflicts 
and confusions, or, if they do, their perception 
is often vague and ill-defined or purely in- 
tellectual. They see the problem as the child’s, 
not their own. Moreover, they come to the 
counselor with the express purpose of obtain- 
ing specific, authoritative advice from him 
about “the right way” to handle their child- 
ren in certain perplexing situations, or they 
seek an explanation of how and why they have 
failed. Often they come to the counselor as a 
last resort and are intent on obtaining definite 
guidance from him, with an aggressive de- 
pendency motivated by desperation. Since they 
come with definite attitudes and goals related 
to their children, the counselor’s reflections of 
their feelings, no matter how sensitive and 
empathetic, may be perceived by parents as in- 
adequate and frustrating. 

Such parents often continue to demand di- 
rect advice, and when it is not given, they ter- 
minate the contact, because they feel their real 
needs are not being met. To illustrate, a few 
excerpts are presented from an interview with 
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a parent who clearly experienced the strictly 
nondirective approach as unsatisfactory. 


A Parent Dissatisfied with the 
Nondirective Method 


Mr. J. came to see the therapist about his 
five-year-old son Ernest, who was a withdraw- 
ing, sensitive child, afraid of new situations 
and easily upset. For the first twenty minutes 
Mr. J. related incidents of Ernest’s behavior, 
expecting the therapist to give a diagnosis on 
the basis of the described behavior and offer 
definite suggestions for handling the child, or 
at least a definite therapy plan. When he dis- 
covered that only his feelings were being re- 
sponded to, he began to pressure the therapist. 


C:2 I’m not worried, but I’d like to know about 
certain things we're doing, whether they’re right 
or wrong, and should we attempt to change them. 

T: I can see that that is of great concern to you. 
(Pause) 

C: Specifically, his lack of a desire to make any 
decision at all. He has a fear of new friends and 
new situations. That’s my main concern. He’s very 
quiet. He’s a poor sport, too. Should we always give 
in to him, or should we let him know that he must 
lose at things sometimes? That’s what I want to 
know. 

T: You see those aspects as problems and feel 
very unsure about what to do, is that it? 

C: I don’t know if we’re really seeing anything. 
Would you like to ask me some questions now, or 
what? 

T: You don’t feel as if we're really getting any- 
where this way? 

(After a long pause the client then related more 
instances of the child’s troubling behavior and the 
therapist reflected the feelings involved. Again they 
came to an impasse.) 

C: One of the things I want to ask your advice 
on is: should we let him go his own way or should 
we correct him? ... Well, am I supposed to ask 
you now what to do, or do we just go on like this? 
What do we do in a situation like the one I told 
you about? 

T: There are really no right or wrong answers 
to these things—we can just explore your feelings 
about the situation and perhaps work out some 
solutions together. (Pause) I gather that you feel, 
at this point, that you can’t handle it alone any 
more, that you’re very unsure about your next 
move. 

(Again there was a period when the father illus- 
trated the child’s behavior with still more incidents, 
stressing the seriousness of the problem and his 
need for expert guidance.) 


2C, client; T, therapist. 


C: I don’t know whether you have a picture of 
the child or not. We would like to know whether 
we should bring him out more, make him more of 
a social being to live in the world, or let him re- 
main as introverted as he is. 

T: You feel there might be something that should 
be done to change him, that if he were more out- 
going life might be easier for him. 

C: Oh, he could live fairly happily the way he’s 
going. .. . But should we force him to meet new 
situations, or shouldn’t we? That’s the question. It’s 
just a problem of how to attack this—I don’t know! 
For instance, should we try to force him to go to 
nursery school when he gets all dressed up for it 
and then refuses to get on the bus? Or should we 
just let him go his own way? My wife feels we 
shouldn’t coerce him, and I’ve always deferred to 
her—but we want your opinion, Which is better to 
do? 

T: Evidently you feel it might be better to co- 
erce him? 

C: I do feel that, but maybe my professional 
training is influencing me, because in my job you 
often do have to be firm. But I would like you to 
tell me what is really right to do. For instance, when 
I try to play with both children, each wants to do 
different kinds of things, so it usually ends up with 
one of them mad. Now how would one handle that? 

T: It is important to you to satisfy both of them 
and have them both happy, is that it? 

C: Yes, and we would like some suggestions. 

T: I know you would like me to be able to give 
you some definite answers and tell you what to 
do, but I can’t really do that. We can continue to 
look at the problem in all its aspects and perhaps 
work out something. 

C: Well, I don’t know as you've said anything 
yet. Are we expecting too much of him? Can’t you 
give us an answer? Specifically, shall we defer to 
him in competitive games? Shall we let him win 
in order to give him confidence, or what? 


Near the end of the interview the therapist 
explained the possibility of play therapy for 
the child. Here again Mr. J. wanted an opin- 
ion as to whether or not play therapy would 
definitely help Ernest, and whether he and his 
wife would be helped by any further inter- 
views. He left without making any decision 
and did not return for further interviews. He 
expressed the opinion that he had wasted his 
time in coming to the service. Obviously the 
kind of counseling offered was not meeting his 
needs, as he perceived them, in any way. 


Results of Modifying the Nondirective 
Method 


After a number of such experiences one of 
the counselors decided to try veering from a 
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strictly client-centered approach with one 
mother who was dissatisfied with the inter- 
views. The change was made in the last few 
minutes of the third interview, and its effect 
upon the client was striking. Mrs. L. had 
spent most of the hour up to this point describ- 
ing her son’s hyperactivity and disobedience. 


C: He’ll get worse and worse. He needs to be set 
right. How he got wrong I don’t know. We didn’t 
do it intentionally. I think we must have made 
some mistakes and he is on the wrong road and we 
can’t reach him and help him. 

T: The biggest problem then is trying to set him 
in the right direction, I see that our time is about 
up. Would you like to make another appointment? 

C: No, I don’t think so. Not if this is the way 
we are going to operate. I can see no point in com- 
ing here and just telling you all the things he has 
done unless you are willing to discuss them with 
me. For instance, we insist that he tie his own 
shoes when he asks us to do it. Other children his 
age tie their shoes. What I would like you to do is 
tell me whether that is right or wrong and how 
we should change. 

T: Yes, I understand it would be much easier if 
I could tell you what was right and wrong in your 
relations with Fred, but I don’t have any specific 
answers. 

C: But you could tell we what you think. What 
about the shoe business? 

T: Well, I don’t believe that in itself it is 
either right or wrong to insist that he tie them. I 
would rather look at it in a different way. For in- 
stance, perhaps Fred is trying to reach you and 
your husband in some way so that the shoe busi- 
ness is really an expression of some other need. 

C: You mean he might be asking for something 
else? 

T: He might be reaching out for some sort of 
security in his relationship with you. 

C: Then how could I help understand that bet- 
ter? 

T: Well, one way you might do that is by try- 
ing to respond more to his feeling about things, 
trying to show him that you understand how he 
feels inside. 

C: You mean that when he asks us to tie his 
shoes we could try to discover what is behind it. 
Well, he sometimes says, “Oh, I don’t want to do 
it, it’s too much trouble.” Do you mean it would 
be better to say to him, “It is a very hard thing 
to tie your own shoes”? 

T: That is one thing you might say. 

C: Then he would expect us to tie them all the 
time. 

T: Since you feel that you shouldn’t tie them, 
perhaps you could still respond to his feelings in 
some way, but insist that this was one of the 
things that he ought to do for himself. 

C: Well, I know that the time is up. I would 
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like to come in again. These last few minutes have 
been more helpful to me than anything you have 
said. My husband feels that way too. We think it 
would be more helpful to us if we could talk about 
things in just this way with you. 

T: Something like this was what you really 
wanted, and it has been much more satisfying to 
you. 

C: Yes, it definitely has. 


Essentials of the Altered Approach 


Just how did the therapist alter his original 
approach in this contact and those that 
followed? He maintained the basic attitudes of 
client-centered counseling; he reflected and 
clarified the client’s feelings. He did not at- 
tempt to persuade. He did not interpret the 
child’s behavior or the mother’s feeling about 
it according to some external system of psy- 
chological concepts. He did not diagnose or 
attempt to deal with “resistances” and “de- 
fenses.” He kept the focus of the problem 
where the client placed it—in the child. How- 
ever, he did discuss with the mother incidents 
of the child’s behavior which she wanted to 
know how to handle. He did give some in- 
formation about children’s behavior and offer 
possibilities for action which the parent was 
completely free to evaluate in her own way. 
He widened the client’s perceptual field in the 
situation by offering her views other than 
those she had seen, but he accepted any decision 
she made or any feeling she had about them. 

To illustrate more clearly how the altered 
approach operates, excerpts are presented from 
an interview with Mrs. F., who had previously 
seen a strictly nondirective counselor and had 
been so dissatisfied that she had asked for a 
different one. The second counselor tried the 


changed approach. The interview proceeded as 
follows: 


C: Before I start talking to you about Ronald, 
there are a few things I’m concerned with. The 
person I talked with before was passive and made 
me feel insecure. Every time I asked her something 
she’d just go back to the way I felt about it. Well, 
I came here to get some help and when I asked 
questions I wanted some explanation from her, 
some of her thinking. I realize it might be helpfal 
to have someone who’s objective listen, but I want 
more than that. I don’t expect to be given flat an- 
swers to questions, but I do expect some ideas to 
be offered to me. 

(She then proceeded to describe Ronald’s diffi- 
culties. The therapist responded here to the feelings 
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she expressed. Following this Mrs. F. 
questions. ) 


raised some 


C: There are two things we would like to know. 
First, is his speech normal, or should I take him 
to a speech clinic? We're unable to reach him com- 
pletely, for some reason. Maybe he’s been hurt at 
some time, and insecure. How can 
we help him? Can you answer these questions? 

T: I really can’t say whether Ronald should go 
to a speech clinic or not. There are a number of 
possible explanations for speech retardation. There 
may be maturational involved. You have 
mentioned that he’s advanced in other areas, and 
it may be that in time he'll catch up in his speech 
without any special help. Another possibility is that 


He’s sensitive 


factors 


he may be bothered by some troubling feelings and 
ideas and that these are blocking his speech devel- 
opment. A third possibility might be that he has in- 
sufficient experience in listening and talking to peo- 
ple. Another possibility might be that he is pretty 
much operating at his potential level of perform- 
ance. 

C: Well, that is helpful, to look at it in those 
different ways. I don’t think it’s his level, because 
he’s so much brighter than that in solving puzzles 
and understanding stories. 

T: You're pretty sure then it’s not that? 

C: Yes, I think so. It also couldn’t be that he’s 
had insufficient practice because we talk to him a 
lot at home and give him plenty of opportunity to 
express himself. I'm not sure which of the other 
two it is. It may be a little of both, I’ve noticed that 
he’s very afraid of new situations and new people, 
so that there some emotional factors that 
are not too clear to us. What about those? 


may be 


T: We could explore some possibilities. It’s true 
that some infants from a very early age show cer- 
tain qualities that seem to continue in spite of ex- 
perience. 


C: Well, I’ve always noticed that Ronald has 
been more sensitive than most other children. I’ve 
always wanted him to be outgoing, so maybe I’ve 
pushed him that way. Maybe he’s just not an ex- 
trovertive type of child. I should accept him as he 
is. Maybe that’s the reason he’s insecure—because 
he can’t accept himself as that kind of child. 

T: That is another possibility—that he might be 
insecure because there are certain things about him- 
self that he has not accepted, or perhaps doesn’t un- 
derstand. 


C: You know, I had almost decided not to come 
to this place any more. I’m glad I did, because this 
has been most worth while. As for Ronald, I think 
I will just bring him down for play therapy to see 
if anything can be done. If it doesn’t change him, 
well, it’s all right anyway. 

T: You feel that you might be able to accept him 
as he is if he should remain essentially the same. 

C: Yes, looking at it that way. 


io ~ 
- 


It is clear that in this interview the client 
saw the therapist as a helpful person with 
whom she could discuss things, and that she 
gained some insight into her difficulty in ac- 
cepting the child and the ways this difficulty 
might be affecting him. 


Adapting the Approach to Individuals 


Many cases can be handled with less alter- 
ation of the basic approach. There may be 
occasions, in cases handled in the regular 
client-centered manner, when it is important to 
respond directly to parents’ questions about the 
child’s progress or to reassure them. The in- 
dividual parent and his problems must deter- 
mine how far and in what way the counselor 
will veer from the nondirective approach. Some 
parents begin by focusing the problem in their 
children, but in the security of a client-centered 
atmosphere go on to explore their own emo- 
tional experiences. Such parents can be helped 
most if a client-centered relationship is main- 
tained. 


‘Tentative Principles of Parent Counseling 

The following tentative principles for client 
centered counseling with parents are presented 
as a result of the experiences discussed in this 
paper. 


1. When the parent can see the difficulties 
as focused in himself and his own feelings, the 
traditional nondirective approach should be 
maintained. 


2. Some parents find a strictly client-cen- 
tered approach frustrating and inadequate. For 
them the approach may be modified as follows: 
The therapist may present sound child develop- 
ment information in answer to direct questions, 
leaving the evaluation of such information 
and decisions to act upon it completely to the 
client. When the parent raises questions, the 
therapist should offer a tentative explanation 
or description of the child’s progress, without 
giving interpersonal factors at any time. Occa- 
sionally the therapist may be supportive when 
the client seems urgently to need support in 
order to relate more adequately to his child. 
The therapist avoids becoming purely educa- 
tive. He presents information as tentative 
rather than authoritative. Moreover, he makes 
no value judgments about the information 
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or how the parent should use it. He also avoids 
becoming analytical and external in his view 
of the client. He does not make diagnoses or 
interpretations, but follows completely the 
client’s percepts of the problem. 


3. In all cases basic client-centered philoso- 


phy should be maintained in constant sensi- 
tivity and response to the parent’s feelings and 
confidence in his ability to make his own de- 
cisions and work out satisfactory solutions to 
his difficulties. 


Received February 25, 1952. 














On the Nature of Psychotherapeutic Interpretation 
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A psychotherapist is presented with but 
three things with which to work: (a) verbal 
productions, that is, the content of the client’s 
or patient’s remarks; (4) gestural productions, 
that is, the tone of voice, facial expressions and 
other bodily gestures; and (c) continuities and 
discontinuities of these productions from one 
part of an interview to the next and from one 
interview to another. 


With regard to the significance of these pro- 
duction-continuities, the therapist engages in 
two distinguishable activities. First, he con- 
strues the production as having a certain sig- 
nificance or meaning. This he does in his own 
mind and as a product of his own training, 
value system, and frame of reference. This ac- 
tivity we may call interpretation qua construc- 
tion. Second, he expresses to the client or 
patient that construction which is best calcu- 
lated by the therapist to be of help to the client 
or patient. This activity we may call interpre- 
tation qua expression. 

The greatest concern among psychotherapists 
to date has been with interpretation qua con- 
struction. The concern has varied among such 
questions as whether dream symbols have 
standard meanings, whether some parts of a 
dream have direct referents and other parts 
symbolic referents, whether dreams represent 
old wishes or preparations for coming strug- 
gles, whether unconscious referents should ever 
be involved at all. The concern over the “real” 
significance of a production is still a major one. 
The bases for construing meaning still vary 
widely. Later in this paper it will be proposed 
that even though the efforts after meaning 
vary widely, they nevertheless impinge upon a 
common element. 

As to interpretation qua expression there has 
been a great deal of concern regarding the 
form of the expression. Put in another way, 
therapists have been cognizant that the way in 


which an interpretation is “put” to the client 
or patient will have a definite effect upon the 
nature of the reaction to the interpretation. As 
a consequence, stress has been placed upon the 
“dosage,” that is, the amount of interpretation. 
Also, criteria of timing of interpretation have 
been developed based upon the analogy of the 
presenting symptoms in childbirth. Further 
evidence of this concern is in two researches 
which have been addressed to the immediate 
effect of interpretation as found in phonograph- 
ically recorded interviews. One study, done 
by Bergman [1] indicated, among other find- 
ings, that when nondirective therapists inter- 
preted in response to client’s requests for eval- 
uations of some sort, the consequence was a 
significant drop in subsequent self-exploratory 
activity. A second study, done by Rickard [4], 
indicated that in a sampling of Freudian, Ad- 
lerian, Horneyian, and Rogerian interviews in- 
terpretation was followed by resistance with 
significantly greater than chance frequency 
and noninterpretive statements were followed 
by resistance with less than chance frequency. 
In both of these studies the client-centered “re- 
flection of feeling” was not treated as an inter- 
pretation even though it involved the act of 
construing and the act of expressing a con- 
struction. 

There are many who have insisted that “‘re- 
flection of feeling” is thoroughly interpretive. 
There are many who have insisted that it is 
not. Why should there be this difference? The 
writer contends the reason is due to a con- 
fusion between the two aspects of interpreta- 
tion, as defined here, and two other factors 
which invariably accompany interpretation. 
These two factors are the intent of the thera- 
pist and the origin of the meaning construed. 
Client-centered therapists have denied they 
were interpreting, not because they did not 
construe meaning nor express constructions but 
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because they made every effort to stay with the 
meaning originated by the client and because 
they made every effort to understand, appreci- 
ate, and accept the client’s proferred meaning 
rather than to lead, guide, suggest, open up, or 
otherwise teach or add meaning to what the 
client produced. One recent book in the non- 
directive orientation is mainly devoted to the 
development of skill in the construction of 
meaning and in the expression of the construc- 
tion in a manner consistent with the aims of 
client-centered therapy [3]. It is clearly evi- 
dent that the concern over interpretation qua 
expression is as ubiquitous as is the concern 
over interpretation qua construction. 

The writer suggests that interpretations qua 
constructions fall into five classes. The follow- 
ing are given in illustration. 

First, the meaning in a given production may 
be construed as of “symbolic” significance, that 
is, a production or an element of a production 
may be construed as symbolizing a person, 
event, condition. Such 
“symbolic” constructions are frequently made 
in connection with dreams, fantasies, and slips 
of the tongue. 


feeling, process, or 


In a first interview a man suddenly 
stopped his flow of conversation to say, “Funny, a 
picture just flashed into my mind, a picture of a 
woman being cut in two by an airplane propeller. 
That’s odd.” He then continued with his former 
topics. His therapist was a woman. The symboli- 
zation of hostility toward the therapist was con- 
strued but in this case was not expressed. 


young 


Second, the meaning in a given production 
may be construed as of “representational” sig- 
nificance, that is, the production may be con- 
strued as representing but not describing 
certain attitudes. 


A young veteran had sought aid from physicians 
for a period of a year in connection with various 
ailments. He was eventually sent to a mental hy- 
giene clinic. During the opening interview he 
seemed ill at ease and seemed to have little con- 
ception of why he was there. Four times during the 
course of the hour he brought up stories apparently 
unrelated to the therapy or to each other. He re- 
lated how a buddy of his fighting in Korea had 
just been court-martialed for going the wrong way. 
It seems one officer told him to go this way, then 
two minutes later another officer told him to go 
that way and then the MP’s picked him up. A 
little while later he was suddenly reminded of an- 
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other buddy, who was engaged to a girl and about 
to get married. They set up a joint bank account, 
One later the bank 
another had His other stories car- 
ried the same theme of “he who trusts others will 
be destroyed.” 


week girl, the account and 


man vanished. 


These stories seemed to represent 


quite well the apparent attitude of the man toward 
his therapist. 


A third and more complex type of situation 
may lead to construing a production as of 
“parallel” significance, that is, the production 
may be construed as directly descriptive of 
attitudes or feelings previously expressed or 
acted In the illustration to follow the 
parallel exists in the client’s verbalization of 
attitudes toward his wife which the client had 
acted out toward the therapist. 


out. 


In the third interview and again in the fourth, 
fifth, and sixth interviews the client “bummed” the 
therapist for that he 
trying to quit smoking but that coming in for inter- 
vie ws 


a cigarette. He stated was 


made him rather tense and the cigarettes 


would surely help. Several interviews later he de- 


cided he wanted to see the tests he had taken 4s 
part of a research program. His request had been 
turned down by the psychometrist because their 
extensive nature would have required too much 


time to make copies. The client sought desperately 
and attackingly to induce the therapist to inter- 
vene with the psychometrist. A few interviews 
later the client asked to have his interviews extend- 
ed from one-hour periods to hour-and-a-half pe- 
riods. On the day of the eighteenth interview the 
client appeared a minute or two late. Just as he 
entered the office he said, “Boy, have I been busy 
today. I worked right up to the last minute and 
didn’t have time for my usual cup of coffee. Is 
there a cup of coffee around the joint?” During 
the material which followed the client spoke of 
how his wife had so little opportunity to get out 
of the house for the purpose of developing her own 
interests and from time to time he offered to 
work at home and baby-sit. Then he said, “And 
do you know what she does? She takes advantage 
of the offer!” Although it had for him a humorous 
aspect he emphasized his feeling of being taken 
advantage of. He continued, “Maybe, if she went 


how 


out of her way to do something special for me 
then, maybe I wouldn’t feel so put upon. Even 
maybe if she did something special two or three 
times I would feel all right about it.” At this point 
the therapist raised the question of whether it made 
sense to think of himself as from time to time re- 
lating to people by fighting, asking, and giving for 
the more obscure purpose of seeking reassurance of 
their affection for him or their positive regard for 
him. His apparent giving to his wife for the pur- 
pose of getting special treatment in return, and his 
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requests for cigarettes, longer hours, intervention 
in his behalf, and coffee, for the purpose of evok- 
ing gestures of regard were all suggested. The 
parallelism was construed and the construction ex- 


pressed. 


A fourth class of construction may be seen 
when the meaning construed is of the “imme- 
diate experience” of the client or patient, an 
experience which the person is having but 
which is not being directly described. 
twenty-sixth in- 
terview states, “If this therapy is going to get any- 


A young woman student in a 
where there are some things which I know I should 
bring up and yet, I find it 
I should yet I just can’t.” The 
therapist remarked, “Maybe you are afraid of how 


just about impossible 


to bring them up 


I might judge you, how I might feel about you.” 
“Well, yes and no,” she replied. The therapist then 
went on to say, “Look, you've seen me enough now 
and from your studies in this area you know very 
that 


my finger at you or 


well I’m not going to moralize with you or 
shake express any shock at 
what you might say. What I mean is that maybe 


you are afraid of how I might feel about you be- 


hind all this, how I might feel about you person- 


ally rather than professionally.” Here the therapist 


construed her productions in terms of an imme- 


diate experience which the woman was undergoing 
but not expressing directly in the sense of describ- 
ing directly, 


A fifth and last class of construction is a 
construction relative to the “direct assertion” 
inherent in a production. It is this class of con- 
struction with which nondirective therapists 
have in the past dealt in largest part. In the 
case of the young woman student just cited 
there is the direct assertion of a feeling of con- 
flict between what she knows she ought to do 
and a strong reluctance to so do. 

The examples given illustrate two things. 
First, that interpretations qua constructions 
may be ordered in five classes and, second, that 
the productions of clients or patients may be 
thought of as being in five comparable classes. 
This might suggest to some that interpreta- 
tions qua constructions should be geared to the 
form of the production, that a dream, for ex- 
ample, should be construed for its symbolic 
aspect and an immediate assertion construed 
for its assertional aspect. There is another way 
of looking at this, however. It seems very like- 
ly that every client or patient production has 
all five aspects and hence may be construed as 
having meaning in any one or in all five con- 


345 


struction classes. ‘The young man who had the 
the bits 
by the airplane propeller clearly was asserting 


fantasy of woman being cut to 
his feeling of how odd, how peculiar it was 
that the 


versely, immediate assertions may also be con 


fantasy should have occurred. Con- 


strued as having symbolic meaning. 


At the outset of a fortieth interview the therapist 
pointed out to a young man that in nearly every 
had 


before. The therapist raised the question of wheth 


interview he began by reviewing what gone 


er or not this had any significance in the client's 


relationship to the therapist. During the client's 
explorations for significance it developed that 
reviewing each time he always came upon “miss 


ing spots in my knowledge of the building blocks 
built.” He 
nature of his 


with which my personality is went on 


to discover the defensive behavior, 


his manner of acting which was designed to pre 


vent the therapist or any one else from controlling 
the conversation, and indirectly, from controlling 
him. Sometime later the phonographic recording of 
this hour was being played to a group of clinician 

why the 


One of them, in an effort to understand 


therapist operated as he did, raised this question 
“Did it occur to you at all to point out to him that 
he had blocks’ 


blocks can be used walls of 


term ‘building and that 


to build 
that is what he was doing, defending himself from 


used the 


defense 4 that 


outside attack?” Here a part of the production was 


construed for its symbolic significance, a signifi 
cance in this case which the client came to in di 
rectly assertional material 


It is suggested that were we to study differ- 
ent therapists and therapists of known train- 
ing frames of references we would find charac- 
teristic patterns in the use of favored classes 
of interpretations qua constructions. Prior re- 
search on counselor behavior, behavior con- 
sidered as types of activities regardless of the 
topical content, has shown quite stable patterns 
in the use of techniques from one interviewee 
to the next, from one interview to the next, 
and from one counselor to the next in a given 
school of training [2]. This study, however, 
took no account of the relationship between 
what the client expressed and the therapist’s 
response thereto. 

It seems to be a matter of general agreement 
that a given attitudinal state may be expressed 
in many ways. For example, hostility toward 
a therapist may very well appear in terms of 
a destructive dream or fantasy, or in terms of 
apparently unrelated memories which carry a 
hostile theme toward a significant social figure, 
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or in terms of strong feelings against a person 
known to be related in some way to the thera 
pist, or in terms of an irritation with some 
thing the therapist does, or in terms of a direct 
expression of feelings against the therapist. 
As long as the interpretation qua construc 
tion relates accurately to the attitudinal state 
being expressed in the production, | believe 
therapy will take place, although the type of 
development and degree of development may 
differ with the type of construction expressed 
and the stage of the client's or patient's famili 
arity with the mode of communication. This 
is what I meant when I said earlier that efforts 
after meaning, even though apparently quite 
different, might nevertheless impinge upon a 
common element. I further believe that re 
search directed at relating the interpretation 
qua constructions which are expressed to the 


If, Porter, Jr 


subsequent activity of client or patient will 
give us information as to how each class of in 
terpretation influences the speed with which 
and the channels in which the conceptual and 
affective reorganization we call therapy takes 
place. 
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Some Characteristics of Clinical Judgment’ 


Bertram RK. Forer and Ruth 8. Tolman 


Veterans Administration Mental Hygiene Clinic 


Los Angeles, California 


In the present stage of development of clini 
cal psychology, the study of the psychologist 
himself and of his judgments needs to go hand 
in hand with the study of his instruments and 
methods. It is appropriate that we should be 
constantly concerned with the validity and re 
liability of clinical instruments. At the same 
time, and especially in the case of projective 
methods, consideration of the nature and deter 
minants of the judgments and procedures of 
clinicians can be instructive. While we are at- 
tempting to determine whether clinical judg 
ments are right or wrong, we need to pay at 
tention to what clinical judgments are like, 
to the kinds of information that contribute 
most substantially to them, and to the influence 
of personal bias and need on the part of the 
clinician. A related point which needs explor- 
ing is the kind of feelings which the clinician 
has about his clinical activities, in particular 
the degree of confidence or certainty which he 
experiences. We know very little about the 
clinical psychologist’s feelings as he makes his 
judgments or about the kinds of information 
he employs with confidence or with misgivings. 

Such study of the clinical psychologist him- 
self has only begun. A brave start has been 
made by Kelly and Fiske [4, 5] who are trying 
to devise methods to differentiate between po- 
tentially “good” and “bad” clinical psycholo- 
gists at an early stage in training. Related data 
on the subject are appearing, such as Anne 
Roe’s study [7] of the psychologist among the 
scientists and the clinician among the psycholo- 
gists. Luft’s comparison [6] of the clinical pre- 
dictions of physical scientists with those of 
members of psychological disciplines yields pro- 


1Reviewed by the Veterans Administration and 
published with the approval of the Chief Medical 
Director. The statements and conclusions published 
by the authors are the result of their own study and 
do not necessarily reflect the opinion or policy of 
the Veterans Administration. 


vocative results, A suggestive study of per vonal 
judgment 
[9|. Other 


along similar lines are under way [1] 


determinants of clinical was per 


formed by Weingarten projects 
Procedure 


‘The 


clinical judgment reported here was the 100 


vehicle used for the investigation of 
item structured sentence completion test des 
cribed in detail elsewhere [3]. In constructing 
the test, several personality areas were eet up in 
advance and were approached by items framed 
in two basic forms: first person and third per 
son. The various areas examined in this study 
were selected from the check sheet which was 
designed for the test. They are summarized as 


follows: 


A. Attitudes 


portant 


Attitudes rel 


interpersonal 


ating ?t vari 
figures in the 


patients fe 


father, mother, males, females, people, authorit 


B. Dominant needs or drives 


Kinds 


which provoke the patient’s aggression, anxiety 


C, Causes of environmental conditions 
pression, failure, guilt. 

D. Reactions 
patient reacts (or thinks of reacting) to a varie 


_ 


Characteristic ways in which the 


of interpersonal relationships: aggression, rejection 


failure, responsibility, sex, love, and marriage 


A memorandum was sent to all clinical psy- 
chology staff members and advanced trainees 
at three local installations of the Veterans Ad- 
ministration. The memorandum described our 
research interest and requested them to make 
a priori judgments about the potential clinical 
value of each item in the sentence completion 
test and to describe how certain they felt of 
each value judgment according to the follow- 
ing rating scale: 


A. From the character of this item, how helpful 
would you expect it to be in contributing to your 
clinical judgment of a patient? Indicate by check. 
(1) Will make little or no contribution to 
clinical judgment. 
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(2) Will 
clinical judgment 
(4) Will make considerable 
clinical judgment 
Bh. How much certainty do you feel 
rating you have made in A? 
(1) Definitely 
(2) Reasonably 
(4) Very 


make moderate contribution to 


contribution to 
about the 


uncertain. 

certain 

certain 

Accompanying this request for ratings was a 
blank copy of the sentence completion test. Re 


plies were received from 30 psychologists.” 
Questions 


1. ‘The first problem we wished to consider 
was the clinician's preconceptions as to the 
and third 
person items. Do clinical psychologists expect 


comparative clinical value of first 


one of these two types of item to provide a 
better avenue of clinical projection than the 
other, hence a better stimulus for eliciting at 
titudes ? 

2. The next problem was to describe the 
clinician’s beliefs regarding the comparative 
clinical usefulness of items dealing with differ 
ent kinds of content. Will items in some con 
tent areas be judged more likely to provide 
valuable clinical 
other areas? 


than items in 
It was our belief that certain 
areas, such as attitudes toward parents and 
reactions to aggression, would be considered 
the best sources of clinical data. 


information 


3. The most interesting cluster of questions 
dealt with the problem of the clinician’s 
certainty or confidence in forming clinical 
judgments. Are clinical psychologists more 
confident in making some judgments than in 
making others, and if so, is it possible to dis- 
cover some of the factors affecting this vari- 
ation in confidence? Does confidence appear 
to vary in relation to objective features of the 
test material, such as first- or third-person 
formulations? Does it vary consistently among 
clinicians, so that one individual is habitually 
certain of his judgments, another uncertain? 
Does it vary in relation to agreement or dis- 
agreement with majority opinion in the group 
of clinicians? Or does it vary with the charac- 
ter of the judgment made, such as the ex- 

2Some weeks later a protocol filled out by one 
patient was sent to the same clinical psychologists 
accompanied by the same rating scale. Judgments 


on the completed protocol were received from 25 
psychologists. 
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tremeness or moderation of the judgment, or 
the amount of value ascribed to an item / 
KMesults 
| Preconceptions 44 to comparatwe clinical 


Dietri 


butions of ratings on first-person items within 


walue of firat- and third-person items 
each content area were compared a8 a group 
with those of third-person items in the same 
area, Chi squares were computed to estimate 
On the blank form 
of the test only one difference reached the 05 
level of 


Since this 


significances of differences 
ignificance in ZZ teste of significance. 


cannot be considered a significant 
difference 


difference 


It would seem that whatever conceptual system 


our original hypothesis that real 


would obtain was not supported, 


this group of clinicians used for evaluating the 
( first 


was of no general signifi 


item, the form in which it was framed 


or third person ) 


cance,” 


2. Comparatwe clinical usefulness of tlems 


in different content areas. In order to dis 


cover which areas of personality information 


(attitudes toward interpersonal figures, domi 


nant needs or drives, causes of feelings, or 


reactions) were believed to be the most clini 


cally fruitful, chi-square comparisons were 
made between the distributions of usefulness 
ratings (value ratings) in the various areas 


First 
for these comparisons. Table 1 gives 
the results. 


of content and third-person iterns were 


combined 


From these comparisons it is clear that clini- 
cians hope to get more useful clinical material 
from items dealing with causes of emotional 
states and from items dealing with reactions 
toward specified situations than from items 
dealing with interpersonal attitudes. 

®The same procedure was followed for ratings 


of the « 


yne completed protocol. While there was no 
significant difference between distributions of all 
first- and all third-person items, some of the first- 


person items were preferred t& 
within an area. The opposite was true for 
of items in other areas. These results are in 
tial agreement with those of Sacks [3] 
[2]. Although no attempt has been made to inves- 
tigate systematically with many completed protocols 
the areas in which items are preferred in first or 
in third person, our findings suggest that the pres- 
ence of concrete clinical material does alter a priori 
judgment. It would seem reasonable that the pat- 
tern of preference might change with the specific 
completions provided by different patients. But on 
this point we have no data. 


} rar . ’eT 
thira-per™ iteTns 
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‘Table | 


Chi-Square Values for Dillerences between Distri 


butions of Value Judgments among 
Content Areas 
Content 
Area Mean (Causes Drives Reactions 
Causes 2166 
Drives Z.14/ 47451 
Reactions 1146 A AYA 1490 
Attitudes 2.025 14,452°9° 4,208 14.672°* 
** pe ol 
ee p* ool 


However, within a content area there were 
clinical value 


ascribed to the different groups of items, Al 


important differences in the 
though items dealing with attitudes toward 
interpersonal figures considered as a group 
were rated significantly the lowest among the 
areas, nevertheless items dealing with attitudes 
toward father were considered the most valu 
able of all items within that area and in the 
entire test, as is seen by a comparison of the 
means in Tables 2, 3, and 4. Items concerning 
attitudes 
least valuable. 

‘Tables 3 and 4 show similar differences, 
though less significant ones, within the other 
two content areas. 


toward males were considered the 


3. Confidence in clinical judg- 


ments. Examination of degree of certainty 


forming 
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judgment, specifically the variation with (1) 
the degree to which one’s judgment agrees 
with or deviates from group opinion about an 
item, (ii) the extremeness or moderation of 
the judgment, and (i) the amount of value 
ascribed to an item. 

a. Gonfidence in relation to objectwe fea 
tures of the test materials, ‘lwo features were 
considered in this inquiry: the relationship be 


tween degree of certainty and the casting of 


the item in first- or third-person form; and 
the position of location of items in the test 
series, 

‘Table 3 
Chi-Square Values for Differences between Distri 


butions of 


Value 


dealing with Causes of 


Judge nts among Items 


Feelings 


Mean Anxiety Guilt Ager Depr 

Anx 2.194 
Cruilt 2.176 0.694 
Ager 2.140 0.617 0.484 
Depr 2.056 10.577** 6.341* 7.79%" 
Fail 2.016 9.1143* 962 5.462 0.941 

*p< 06 

pn < 0! 

lable 4 

Chi-Square Values for Differences between Distri 


Value 


dealing with Reactions to Situations 


butions of Judgments among Items 














: . Mean Sex Aggr. Fail Rej. Respons, 
(confidence ratings ) was made with three 
: : : . Se 2.192 
problems in mind: (a) relationships between A : ; a sone 
‘ ; . . Agger 2.176 6, 
certainty ratings and objective features of the pai 2056 2.995 4.361 
test material; (4) the possibility of consistent Re; 2048 4.386 4.362 0.029 
individual differences in confidence among Resp. 2.032 4.666 7.160% 0.884 1.097 
clinicians; and (c) relationships between de- Love 2.008 5.370 5.889 0.406 0.378 0.503 
gree of certainty and various aspects of clinical * p< 06 
Table 2 
Chi-Square Values for Differences between Distributions of Value Judgments among 
Items dealing with Interpersonal Attitudes 
Attitudes 
toward Mean Father Mother Authority Females People 
Father ————— 
Mother a de: 2.160 4.483 
Authority = ending ae 18.729*** 5.617 
Females ee 28.320°** 12.473** 5.301 
People jecingminiminimntiiiy Geena 29.2679 17.900** 12.804 5.989 
Males 1.727 43.214*°** 27.837%** 19.958*** 10.5079%* 1.851 
** p<.0l 


#99 p< .001 
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First- and third-person items. Comparison 
of confidence ratings in clinical evaluations of 
first- and third-person items (using the blank 
test) yielded only one chi square out of 22 
with a_ probability as low as .05. Any differ- 
ences which appeared could, therefore, have 
arisen by chance and there was no support for 
the hypothesis that either first- or third-person 
formulation is more conducive to confidence 
in clinical judgment. 


Location of items in the test. The possibility 
was considered that confidence may increase 
simply with experience in the judging process ; 
hence items which were located far along in 
the test would be judged with more confidence 
than those in the first part. To evaluate this 
possibility, distributions of confidence ratings 
on successive groups of 25 items were com- 
pared. The data revealed a steady rise in con- 
fidence as the clinicians worked through the 
test. The difference between distributions of 
confidence ratings for the first and last blocks 
of 25 items was significant at the .05 level. 
Thus increased familiarity with the judging 
operation produced a significant rise in confi- 
dence in clinical inferences. 

b. Individual differences in confidence 
ratings. When the confidence ratings of indi- 
vidual raters were tabulated, it became ap- 
parent that the total distributions of ratings 
differed considerably from one clinician to 
another. Scores were computed representative 
of the central tendency of each individual’s 
100 confidence ratings by assigning low ratings 
a value of one, middle ratings a value of two, 
and high ratings a value of three. The total 
values, which we have called individual con- 
fidence levels, were computed for both the 
blank form of the test and independently for 
the single completed protocol, in order to ob- 
tain some approximation of the degree of in- 
dividual consistency. Chi-square comparisons 
revealed the presence of a significant similarity 
between individual confidence levels on the 
two tests (chi square 5.75, » <.02). In other 
words, the general confidence or certainty of a 
clinician appears to have some consistency or 
stability in different situations in which clini- 
cal judgment is required. 


c. Relationship between degree of confidence 
and various aspects of judgment. Three 


Bertram R. Forer and Ruth § 


Tolman 
aspects of judgment were considered here: 
(i) agreement or disagreement with group 
opinion; (ii) extremeness or moderation of 
judgments; and (iii) the amount 
ascribed to an item. 


of value 


(i) To obtain evidence on whether or not 
a clinician’s confidence in his judgments was 
related to the agreement or disagreement of his 
judgments with group opinion, the modal 
value judgment of each item was determined 
and the judgment of each clinician on each 
item was described as agreeing or disagreeing 
with the modal value. Chi square as a measure 
of association between the agreement-disagree- 
ment dichotomy and certainty ratings was 
computed independently for the 30 judges. 
Five judges manifested too little variability 
among their judgments or certainty ratings 
to permit computation of chi square. Of the 25 
individual tests of relationship, 15 
significant chi squares, as shown in Table 5, 
some significant at the .01 level. 


showed 


Table 5 
Frequency of Statistically Significant Relationships 
between Certainty and Value, Extremism, 
and Agreement with Group Judgment 


Agreement 





Cer- No 
Certainty Cer- tain Rela- 
and Valu Extremism tain Dis- tion- 
Agree agree ship 
Certain with Certain—Extreme 0 7 3 
high value Certain—Moderate 2 0 0 
No relationship 0 2 3 
No rela- Certain-Extreme 0 2 1 
tionship Certain—Moderate 1 0 0 
No relationship 1 0 3 


It appears that there are real relationships 
between confidence in one’s judgments and 
the similarity of these judgments to group 
consensus. There are, however, individual 
differences in the direction of the relationship. 
Most of the raters were Jess confident of their 
“conventional” (agreeing) judgments than 
of their “unconventional” (disagreeing) 
ones. Only a minority of the clinicians felt 
more confident in the opposite situation, that is, 
when making judgments in agreement with 
group opinion. The data suggest that most 
clinicians make atypical judgments when they 
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feel certain of their grounds, perhaps when 
their judgments are based upon specific or 
differentiated knowledge or conviction. 


A suggestive clue was found when the in- 
dividual confidence levels were tested against 
the number of items upon which the individ- 
with the group 
modal judgment. A chi square of 2.778 was 
obtained (p between .05 and .10). Though 
this value cannot be taken as significant, it 


ual’s judgments coincided 


suggests that individuals who are generally 
uncertain of their clinical judgments tend to 
make relatively few atypical judgments, and 
that confidence is associated with independence 
in clinical thinking. Futhermore, all four 
judges who were uncertain when their judg- 
ments diverged from the modal values were 
also below the median of individual confidence 
levels. The chi-square value of 4.290 (p <.05) 
indicates significantly lower confidence levels 
among those who were most confident when 
in agreement with judgment than 
among those most confident when in disagree- 
ment with group judgment. The latter group 
can be considered as generally confident and 
independent judges who make “conventional,” 
typical judgments only when they are insecure 
in their inferences. 

(ii) ‘To examine the relationship between 
confidence and extremeness or moderateness of 
judgments, a series of statistical tests was 
made between these two variables. A value 
judgment of one or three represents an ex- 
treme judgment, whereas a judgment of 
two represents a moderate position. Of the 25 
chi-square tests of association between the 
dichotomy of extreme-moderate judgments 
and confidence, 16 were significant, as shown in 
Table 5. In the majority of significant tests 
the use of extreme judgments was associated 
with high confidence. This finding coincides 
with and supplements the association between 
divergence from modal judgment and confi- 
dence. Confidence seems in most clinicians to be 
prerequisite to making definitive or extreme 
value judgments of clinical material. A chi 
square of 9.632 (p<.01) indicates a signifi- 
cant tendency for judges who are most confi- 
dent when making extreme judgments also to 
be more confident when making unpopular 
judgments. There is in addition a significant 


group 


likelihood that judges who are more confident 
when making extreme judgments will have 
high general confidence (chi 


11.542, p<.001). 


(iii) The last problem to be considered in 


levels square 


the area of confidence was the relationship 


between confidence and estimated clinical 
value of the items. Of 25 chi squares com- 
puted, 17 were significant at the .05 level or 
beyond (Table 5). All of these demonstrated 
a positive relationship between confidence and 
value judgment and none showed a negative 
The judges felt 


certain when they saw value in an item, but 


relationship. majority of 
expressed a lack of confidence when they saw 
little value in the item. These judges felt less 
confident in rejecting than in accepting items 
as useful. Thus it seems that high confidence 
in this kind of clinical judgment is associated 
with positive evaluations. 


Summary and Conclusions 


A group of clinical psychologists rated the 
100 items of a sentence completion test (blank) 
in terms of degree of potential clinical value. 
They also indicated the degree of confidence 
which they felt in each judgment. These judg- 
ments concerning the clinical value of items 
were then studied in relation to (a) the form 
of the item (first or third person), (4) the 
area of content, and (c) the degree of certainty 
or confidence expressed in the rating. The 
following conclusions appear to be warranted 
by the data: 

1. On a blank sentence completion test, 
the form of the item appears to have no in- 
fluence on judgments of its value. Within the 
same area of content, no significant differences 
in value judgments were found between first- 
and third-person items. 


2. Distributions of value judgments differed 
significantly in terms of area of content of the 
item. Items dealing with causes of feelings and 
with reactions to interpersonal relationships 
were judged to be more clinically valuable than 
items dealing with attitudes toward various 
interpersonal figures. 

3. Confidence ratings were found to be un- 
related to objective features of the test materi- 
al, except that increased experience in the 
rating procedure produced a significant increase 
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in confidence in one’s judgments. 

4. The differential confidence ratings which 
individual clinicians made with respect to their 
own clinicial judgments appeared to be consis- 
tent, systematic, personal characteristics. 

5. There appears to be a relationship be- 
tween a clinician’s confidence in his judgment 
and the similarity of this judgment to group 
consensus. However, most clinicians feel more 
confident when in disagreement rather than in 
agreement with the group. This fact is especi- 
ally true of clinicians whose general confidence 
levels are high. 

6. Confidence is associated more with ex- 
treme than with moderate judgments, especial- 
ly again in those clinicians whose confidence 
levels are high. 

7. High confidence appears to be associated 
with positive evaluations of clinical usefulness 
of items rather than with negative judgments. 


Received March 31, 1952. 
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The Sentence-Completion Technique in a Study 
of Scholastic Underachievement ’ 


Barbara Kimball 


Youth Guidance Center 


Worcester, Massachusetts 


While the sentence-completion technique 
has been widely used as a clinical tool, its po- 
tentialities as a research instrument have not 
been explored as fully as they deserve. To 
date, the literature on sentence completions 
has been concerned chiefly with their use as an 
aid in the clinical diagnosis of individual 
cases [4, 10, 13,] or as a screening device 
[6, 12, 14]. Some attempts have been made to 
validate the test against objective criteria [2, 
5], and there has been some interest in the 
analysis of its formal characteristics [7, 11]. 
Sacks and Levy [8], in reviewing the litera- 
ture, found the technique useful both as a 
screening device for therapy and as a source of 
clinical material. While many issues remain 
unresolved, most clinicians agree that the 
sentence-completion test is a useful diagnostic 
instrument. The purpose of this paper is to 
present an additional way in which it may be 
valuable—namely, as a method for testing hy- 
potheses which arise in the investigation of 
problems in personality dynamics. 

Clinical psychologists are presented with a 
dificult problem when they attempt to investi- 
gate the layers of the personality which the 
subject is either unwilling or unable to reveal. 
Psychoanalysis and the use of projective tech- 
niques have seemed to offer the best solutions. 
The former is usually impractical as a research 
procedure because of its length. The two pro- 
jective techniques which have been used most 
widely, the Rorschach and the Thematic Ap- 
perception Test, are subject to certain practical 
limitations. The very nature of projective 
material almost precludes quantification since 
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in the process many important qualitative fea- 
tures must be lost. Several investigators, 
among them Munroe [3] and Schafer [9], 
have reached the conclusion that projective 
techniques lose more in the process of quantifi- 
cation than they gain in the establishment of 
reliability and validity. There has been a tend- 
ency to treat the issue as an either-or proposi- 
tion: either a projective test is used as a clini- 
cal instrument with all of its qualitative value, 
or its quantification is attempted. 

It does not seem, however, that to use the 
qualitative aspects of the tests 
necessarily precludes a subsequent quantitative 
analysis. A recent study of educational malad- 
justment, conducted at a private preparatory 
school for boys, revealed the potentialities of 
the sentence-completion technique as an easily 
quantified adjunct to more detailed case ma- 
terial. Extensive case studies including psy- 
chiatric interviews, Rorschach tests, ‘Thematic 
Apperception Tests, intelligence tests, teachers’ 
reports, parents’ reports, and medical histories 
were obtained for a limited number of cases. 
As a result of these studies, certain hypotheses 
were 


projective 


developed concerning the personality 
dynamics underlying educational maladjust- 
ment. Conclusions from clinical studies often 
may be erroneous because of faulty observation, 
inadequate sampling, or the personal bias of 
the interpreter. While the flexibility of clinical 
thinking may create excellent hypotheses, such 
hypotheses should ultimately be tested by a 
controlled study. In this context, the one tech- 
nique which stood out as invaluable was the 
sentence-completion test. 


Problems and Hypotheses 


The problem under investigation was the 
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relationship between personality factors and 
scholastic achievement. I'wenty subjects were 
selected for intensive study, all of whom had 
a high level of intelligence as measured by 
both individual and group intelligence tests, 
and all of whom were failing in their school 
work. The subjects were adolescent boys in 
residence at a preparatory school. Despite the 
many variations in both external and personal- 
ity factors among these boys, there were cer- 
tain important features which appeared more 
frequently in their case material than would 
have been expected. 

Two of the major features of personality 
which seemed to predispose to scholastic disa- 
bility will be described briefly. The first and 
perhaps most important feature which ap- 
peared consistently in the case material was 
evidence of a poor father-son relationship. ‘The 
specific nature of this relationship varied from 
case to case, but never was there a warm, close 
contact with the father. The projective tests 
frequently revealed much aggression toward 
father figures. In interviews, the reports tend- 
ed to be either of fathers who worked long, 
hard hours and so had a distant relationship 
with their sons, or of fathers who were strict 
disciplinarians and wished to dominate them. 


Aggression was another focal point of con- 
flict. In practically all of the cases there were 
indications of considerable underlying aggress- 
ion, although only rarely did aggressive feel- 
ings reach the level of overt expression. In the 
projective material two extremes were repre- 
sented : either a large amount of aggression was 
present, or else there was only a very indirect 
expression of negative feeling. Behaviorally, 
there was a pattern of early violent expressions 
of hostility in the form of temper tantrums or 
continual fighting, so there is evidence that 
such feelings were intense at an early age. At 
a later age, these boys now describe themselves 
as never really becoming angry or as never 
openly showing their displeasure. The picture, 
then, is one of strong aggression with little di- 
rect outlet for hostile feelings. 


Other personality characteristics might be 
described, but for illustrating the use of the 
sentence-completion technique to test a prop- 
osition, the following two hypotheses have been 
chosen. 
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a. A significantly higher number of the 
underachievers will reveal an essentially nega- 
tive relationship with the father than will be 
found in the total population. 

b. Aggressive feelings will be a source of 
guilt and anxiety more frequently among the 
underachievers than in the total population, 
and the underachievers will be less able to 
give direct, effective expression to these nega- 
tive feelings than will the members of the 
randomly selected group from the total pop 
ulation. 


Testing Procedure 


One advantage of the sentence-completion 
technique in research is that the test can be 
structured to obtain material relevant to the 
exact questions raised. In this instance a num- 
ber of sent=nces were included to elicit infor- 
mation bearing directly on the two hypotheses. 
A fifty-item form was constructed, some sen- 
tences being taken from earlier forms and some 
being original. All sentences had a specific fo- 
cus, with exception of a few shock absorbers. 
The sentences were phrased in the third per- 
son, at times using the personal pronoun, at 
times using boys’ first names. It was felt that 
the use of the third person alone would tap un- 
conscious attitudes more easily, and also would 
avoid the confusion which arises when there 
is a mixture of first- and third-person items in 
one test. Since the test was constructed, how- 
ever, Sacks [7] has described the differences 
obtained between projective responses to stim- 
uli referring to the self and to other persons. 
Although his general conclusions favored the 
first-person form, he pointed out that the 
third-person form elicited a larger proportion 
of responses indicating negative attitudes to- 
ward father and superiors. As this appears to 
be an important area of hostility for the ado- 
lescent boy, it may be fortunate that all of our 
sentences were framed in the third person. 
The use of the third person is also in accord 
with the comparison which Combs [1] made 
of autobiographies and TAT records, in which 
he found that negative or unacceptable feelings 
were expressed more easily in the less personal 
projective test. An attempt was also made to 
phrase the sentences so that they presented as 
concrete a situation as possible; it is the pre- 
vious experience of the author that concrete 
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items elicit more meaningful responses from 
adolescents than do more general, abstract 
items. 

The test was administered to all students 
during their English classes. The teachers gave 
the test, and did not indicate that it had origi- 
nated in the psychology department. The 
directions used were: 


Please complete the following sentences, Do them 
in order and do not leave any of them out. Work 
as fast as you can because speed is important in 
this test. Be sure that you make a complete sentence 
out of each one. You may finish them in any way 
that you wish as long as you express a complete 
thought. You will have twenty minutes, and you 
should be able to finish them all in this time. 


It was suggested to the teachers that they ad- 
minister the sentence completions much as 
they would a vocabulary test. Each student 
was asked to turn in his test as soon as he was 
finished. Speed was emphasized so that the boys 
would tend to give the first idea which 
occurred to them, without conscious censor- 
ship. It seems probable that this method of ad- 
ministration yields more significant material 
than when the subject is aware of the purpose 
of the test or has time in which to disguise his 
initial reactions. 


Testing the Hypothesis of Father Relationship 


The following procedure was used to test 
the two hypotheses. Seventeen case-study sub- 
jects had taken the test. Their responses were 
compared with those of a control group of 100 
subjects, randomly selected from the total 
school population. 


Two sentences had been included for the 
specific purpose of eliciting attitudes toward 
the father. The first of these ( His father ) 
was almost completely unstructured and drew 
a variety of responses which were classified as 
positive, negative, or neutral. The second 
sentence (When he saw his father coming, 
he ) usually elicited either an avoidance 
or an affiliative response, although it was 
sometimes impossible to classify the response 
and a neutral category was used. Other 
sentences were found in which the subject 
spontaneously introduced the father figure 
either in a positive or a negative context. Such 
items were included in the analysis. 








Specific criteria were adopted for the classi- 
fication of responses. In general, responses 
which indicated friction with, aggression to- 
ward, or dislike of the father were considered 
as evidence for a negative relationship. ‘hose 
which indicated confidence in, identification 
with, or admiration of the father were con- 
sidered as evidence for a positive relationship. 
In particular, the following criteria were 
used; examples from protocols are given in 
each case. 


Responses which indicate a negative father rela 
tionship. (a) Derogatory statements about the fath 
er: His father was a drunkard who could not sup 
port his family. (+) Reference to the death or in- 
jury of the father: Peter burst into tears when he 
heard about his father’s death. (c) Attribution of 
a very low-level occupation to the father, inter- 
preted as a form of devaluation: His father is a 
janitor (when in reality his father is a college 
professor). (d) Avoidance of the father: When 
he saw his father coming, he tried to hide. (¢) 
The father is seen as a restrictive agent: Jeff can't 
do what he wants to because his father won't let 
him. (f)The father is seen as a punishing agent 
Steve ran away because his father spanked him 
(g) Disruptive emotional reaction to the father 
When he saw his father coming, he burst into 
tears. (h) Reference to the father as aged: His 
father is seventy-five years old. These items point 
to a lack of companionship with the father 

Responses which indicate a positive father rela 
lationship. (a) Indications of a high evaluation 
of the father: His father is a very important man 
(b) The tendency to consult the father for advice 
and guidance: Whenever Jack gets into trouble, he 
goes to his father. (c) The expression of a wish 
to be with the father: When he saw his father 
coming, he ran toward him. (d) Complimentary re- 
marks about the father: His father is a very nice 
man. 

Sentences which indicate a neutral father rela- 
tionship. (a) Mention of an occupation with no 
descriptive adjectives except at a very low level, 
in which case the score is negative: His father 
is a doctor. (b) Age references with the exception 
of referring to the father as elderly: His father is 
forty years old. (c) Ambiguous responses: When he 
saw his father coming, he ran. (d) Unclassifiable 
responses: When Frank was at home, his father 
arrived from overseas. 


The first procedure in testing the first hy- 
pothesis was to classify all responses referring 
to the father made by the seventeen case-study 
subjects and by the control group of 100. Each 
subject was given an over-all rating of positive, 
negative, or neutral, based upon the total num- 
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ber of responses involving the father relation- 
ship. For example, a boy with three positive 
responses and one negative response was given 
an over-all classification of positive. The per- 
centage of boys who revealed primarily nega- 
tive relationships was then obtained and the 
significance of the difference between the per- 
centages of the two groups was tested. Accord- 
ing to the first judge (the author), 59 per 
cent of the underachievers had a _ predomi- 
nantly negative rating as compared to 29 per 
cent of the control group. The critical ratio 
for this difference is 2.4 which is between the 
OS and .01 levels of significance. Since some 
bias might have operated in making the ratings, 
a second judge repeated the procedure. In 90 
per cent of the unselected cases there was com- 
plete agreement as to the over-all rating for 
each boy. There was 100 per cent agreement 
on the over-all ratings of the 17 members of 
the case-study group, and 96 per cent agree- 
ment on the single items. The total agreement 
in the two judges’ ratings of individual items 
was 95 per cent. 

The results indicate that a_ significantly 
higher number of underachievers revealed an 
essentially negative relationship with their 
fathers, as measured by the sentence-completion 
technique, than did the unselected students. 
The finding furnishes statistical confirmation 
of the first hypothesis. 


Testing the Hypothesis of Aggression 
and Guilt 


The second hypothesis was tested by a study 
and classification of two sentence-completion 
test items which consistently gave information 
regarding sources of guilt and anxiety. The 
first of these was, “George was sorry after 
he ,»” and the second, “Roger would have 
done anything to forget the time he ye 
Because the main interest was in the aggression 
category, the aggression responses were sub- 
divided into three types: (a) Physical aggres- 
sion toward inanimate objects: George was 
sorry after he broke the lamp. (4) Physical 
aggression toward animate objects: George 
was sorry after he had strangled the woman. 
(c) Verbal aggression: Roger would have 
done anything to forget the time he swore at 
his grandmother. 

The expression of negative feelings by 
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means of displacement upon inanimate objects 
was considered to be less anxiety producing 
than physical aggression toward living beings, 
and possibly not even as anxiety producing as 
verbal aggression, since the former may repre- 
sent a more acceptable type of adjustment. 
Therefore, the hypothesis tested may be stated 
more explicitly as follows: The introduction of 
aggression toward living things is found as a 
source of guilt or anxiety more frequently 
among underachievers than in the total pop- 
ulation. 

Forty-seven per cent of the 17 case-study 
subjects introduced aggression, including verb- 
al aggression, toward animate objects in both 
sentences, in comparison to 13 per cent of the 
100 unselected subjects. The difference be- 
tween these percentages yields a critical ratio 
of 3.37 which is significant beyond the .01 
level. When verbal aggression was excluded 
and only the expression of physical aggression 
toward living things considered, the 
for the 
case-study group and the randomly selected 
group was again significant beyond the .01 
level, the critical ratio being 3.30. To establish 
the reliability of these ratings, a second judge 


was 


difference between the percentages 


repeated the classification procedure. Of a 
total of 234 items, in only 7 was there dis- 
agreement. 

Since there is evidence that physical aggres- 
sion is more frequently a source of guilt and 
anxiety among underachievers, it might be 
predicted that they will have difficulty in ex- 
pressing negative feelings directly. Three items 
of the sentence-completion test were specifi- 
cally designed to see how, if at all, aggression 
would be expressed. The sentences were as 
follows: “When Jack really became angry 
he ”. “Tf anyone should stand in Tom’s 
way, he. would——”; “If anyone bothers 
Carl, he .’ The first item presents the 
subject with the situation of feeling aggres- 
sive or hostile and attempts to evoke his 
reaction to such a feeling. The other two 
present situations which would ordinarily call 
for an aggressive reaction. 





It was found that the responses could be cate- 
gorized into seven groups: (a) Aggressive action 


(not necessarily physical) directed toward removal 
of the irritant: If anyone should stand in Tom’s 
way, he would kill them; If anyone bothers Carl, 


we 
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he tells them to go to hell. (b) Emotional response 
without action: If anyone bothers Carl, he becomes 
angry. (c) Physiological response: When Jack real- 
ly became angry he turned purple. (d) Avoidance 
of issue or withdrawal: If anyone bothers Carl, he 
tries to ignore that person. (¢) Loss of control, or 
action that is not directed toward the irritant: 
When Jack really became angry, he went to pieces; 
when Jack really became angry, he ripped off all 
his buttons. (/) Diffuse verbal reaction: When Jack 
really became angry, he cursed. (g) Dependent re- 
action: If anyone should stand in Tom’s way, he 
would tell his father. A few unclassifiable responses 
were put in a miscellaneous category. 


Among the 17 case-study subjects, seven 
(41 per cent) never gave a response which 
could be classified in the first category, namely, 
aggressive action directed toward the removal 
of the irritant. In the unselected group, only 
13 (13 per cent) never gave a direct, effective 
expression of aggression. The critical ratio for 
the difference between these percentages is 
2.83 which is beyond the .01 level of signifi- 
cance. Judge B classified the responses into 
only two categories, either as aggressive action 
directed toward the removal of the irritant or 
as some other unspecified reaction. There was 
99 per cent agreement between the two judges 
in their classification of responses. 


Discussion 

On the basis of this study certain advan- 
tages as well as several specific limitations in 
the use of the sentence-completion technique 
as a research tool are evident. The easy adapta- 
bility of the test which permits elicitation of 
material relevant to some particular problem 
area is its most outstanding virtue. The possi- 
bility of restructuring the content of the sen- 
tences without changing the basic nature of the 
test itself certainly makes it a flexible instru- 
ment in investigation. Because its adminis- 
tration procedure is short and simple, the test 
is applicable to the study of large groups, and 
allows the collection of large quantities of ma- 
terial with a minimum of effort. In addition, 
the classification of its responses is relatively 
simple if the categories are clearly defined. In 
this study there was a constant effort to make 
the criteria for classification very specific, 
which no doubt affected its high scoring reli- 
ability. 

The final scoring procedure used here is 


ws 
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open to question, particularly in relation to the 
summation of positive and negative scores. 
This summation was intended only as a first 
approximation of quantification; in further 
experiments a more refined method could 
doubtless be developed. Various items were 
equated in terms of their meaningfulness with- 
out any explicitly valid basis for doing so. For 
example, perhaps items in which a subject in- 
troduced the father figure spontaneously might 
carry different weight in terms of attitude to- 
ward the father than items in which the test 
itself introduced the father figure. ‘Taking an 
over-all positive, negative, or neutral score for 
each subject also introduced a source of error: 
a boy who always expressed neutral attitudes 
toward the father probably has a different re- 
lationship pattern than one who gave an equal 
number of positive and negative responses, yet 
both received the same over-all rating. Many 
other questions of procedure can be raised, but 
it seems reasonable to leave the degree of re- 
finement of scoring to the individual experi- 
menter. It should be determined primarily by 
the kind of material involved and by the type 
of hypothesis under investigation. In _ the 
present experiment, the scoring procedure was 
simple and reliable, and answered the ques- 
tions asked despite its limitations for giving 
more detailed information. 

Difficulties arise much more readily in de- 
ciding how the responses should be classified, 
and in determining what categories are psy- 
chologically meaningful. It has already been 
suggested that response classification should 
follow more complete clinical studies so that 
one can proceed, even if indirectly, from a com- 
prehensive clinical description rather than from 
a more elementaristic approach. It follows that 
the sentence-completion technique will be help- 
ful only if the broad studies are thoughtfully 
done. Even when this precaution has been 
taken, the problem of the meaningful categori- 
zation of responses will not be mechanical but 
will depend upon the clinician’s experience, 
insight, and knowledge of personality dynam- 
ics. 

The question of the validity of sentence- 
completion material cannot be solved as easily 
as that of its reliability. On the Sentence 
Completion test one group of subjects may ex- 
press a significantly greater amount of aggres- 
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sion than another comparable group, but the 
behavioral counterpart of the responses is not 
self-evident. It is important not to go beyond 
the data at hand. Complete preliminary studies 
of a small number of subjects are essential, 
and will help solve the problem of establishing 
validity. If valid conclusions can be reached 
through the Sentence Completion test in a 
small number of cases, it may be used with 
larger groups with some assurance that the 
data will not be distorted. This preliminary 
step is necessary if the less complete data from 
larger groups are to be accepted as being 
meaningful in terms of personality dynamics. 

A further limitation of the sentence-comple- 
tion technique is that it presupposes a minimum 
level of education. It cannot be used with 
very young children unless administered orally 
and therefore individually, neither would it be 
helpful in work with illiterate people or with 
those unwilling or unable to cooperate in a 
group-testing situation. However, it is appli- 
cable to groups of school children above the 


third or fourth grade and to most American 
adults. 


Summary 


The sentence-completion technique has been 
described and to some extent evaluated as a 
research tool. An experiment wherein this tech- 
nique was found to be useful is described, and 
certain of its advantages and limitations dis- 
cussed. The study suggests that the sentence- 
completion technique is helpful for testing hy- 
potheses raised as a result of more complete 
personality studies. 


In the particular experiment, two hypotheses 
were examined, using the Sentence Completion 
test, with the following results: 


1. A significantly higher number of under- 
achievers revealed an essentially negative re- 
lationship with their fathers than did a control 
population. 

2. Aggressive feelings were a source of 
guilt and anxiety more frequently among the 
underachievers than in the total population, 
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and the underachievers were less able to give 
direct, effective expression to their negative 
feelings. 
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Rorschach Movement Responses Following Motor 
Inhibition and Hyperactivity” 
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Franklin D. Roosevelt V. A. Hospital, Montrose, N. Y.* 


Julian Meltzoff 


Philadelphia V. A. Mental Hygiene Clinic 


and George D. Goldman 


College of the City of New York 


This investigation is one of a series designed 
to relate Rorschach’s human movement re- 
sponse to a more general conceptual frame- 
work in psychological theory. Strains of evi- 
dence and speculation from diverse sources [5, 
7, 8, 11] have suggested an important relation- 
ship between the capacity to inhibit overt 
motor activity and the development of fantasy, 
anticipatory planning, and mature thought 
processes in general. Rorschach [9] on the 
basis of his observations indicated that an in- 
verse relationship existed between the tend- 
ency to give kinesthetic responses to the ink- 
blot stimuli and the degree of general motility 
characteristically manifested by subjects. Since 
Rorschach also considered the pervasive tend- 
ency to perceive humans in motion on the blots 
to be an indication of the capacity for rich 
fantasy living, he implied, albeit without a 
theoretical formulation, a triadic relationship 
of motor inhibition, motion perception, and 
imagination. 


1A portion of this paper was read at the meet- 
ings of the Eastern Psychological Association, At- 
lantic City, 1952. 

*The authors are indebted to Veterans Adminis- 
tration Clinical Psychologist Trainees Murray Le- 
vine, Leo Katz, Aaron Leve, Bernard White, Frank 
Wexler, Mark Grunes, and Don Tomblen who as- 
sisted in the collection of preliminary and experi- 
mental data. 


*Reviewed in the Veterans Administration and 
published with the approval of the Chief Medical 
Director. The statements and conclusions published 
by the authors are the result of their own study 
and do not necessarily reflect the opinion or policy 
of the Veterans Administration. 


Recently, attempts have been made to de- 
velop general theories from which relation- 
ships such as Rorschach’s might be deduced. 
Rapaport’s [8] psychoanalytic theory of think- 
ing specifies the inhibition of action directed to- 
ward immediate need satisfaction as a basic 
condition for the development of fantasy. 
Weizsaecker [10] considers motion and motion 
perception as two ways of transforming avail- 
able energy and adduces physiological and per- 
ceptual data in support of his special view of 
“introversion” and Werner 
[11] maintains similarly that tonic energy is 
the dynamic property common to both sensory 
and motor activity and postulates a vicarious 
relationship between the two. In support of his 
views, Werner [11] has presented data indi 
cating that the hyperactive exogenous feeble- 
minded have higher thresholds for stroboscopic 
and illusory motion perception than do the 
more phlegmatic endogenous type. Similarly, 
on the Rorschach test these hyperactive child- 
ren produced significantly fewer movement re- 
sponses of all types than did the less active 
group. 


“extroversion.” 


In an attempt to test Werner’s sensory-tonic 
theory more directly in its relation to the 
Rorschach test, Meltzoff, Singer, and Korchin 
[6] obtained responses from subjects before 
and after a period of motor inhibition. It was 
hypothesized that M productivity prior to in- 
hibition would be positively correlated with a 
measure of duration of inhibition. It was 
similarly predicted that following a period of 
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motor inhibition Experimental subjects would 
show significantly more M than Control sub- 
jects who had not undergone motor inhibition. 
Results in general supported the hypotheses 
but raised additional problems meriting further 
inquiry. 

The present experiment represents a further 
effort at testing hypotheses regarding the an- 
tagonism of motor behavior and perceptual 
responses. The inhibition task in the study of 
Meltzoff, Singer, and Korchin [6], writing 
a brief phrase as slowly as possible, was a local- 
ized one restricted to the writing hand and 
arm. The present investigation was designed 
to test the effect of a more generalized con- 
dition of motor inhibition on the subsequent 
perception of movement in the Rorschach. In 
addition, a contrasting condition of motor hy- 
peractivity was included to observe its in- 
fluence on subsequent production of Rorschach 
movement responses. 


The Experiment 
Hy potheses 
1. Subjects who undergo a period of volun- 
tary inhibition of motor activity should subse- 
quently produce an increased number of Ror- 
schach human movement (J) responses com- 
pared with a prior control administration. 


2. A greater number of MM _ responses 
should be produced following motor inhibition 


than following hyperactivity. 


3. Following hyperactivity, subjects should 
produce fewer M than in a control adminis- 
tration. 


The above hypotheses have been formulated 
for the M response as originally conceived of 
by Rorschach. Previous research [2, 6, 11] has 
yielded inconsistent results with respect to the 
other types of movement response, Klopfer and 
Kelley’s [4] FM and m (animal and inani- 


mate movement). 


Procedure 


The Ss were 24 male university students the 
majority of whom were undergraduates em- 
ployed as hospital aides during the summer 
session. All Ss were ignorant of the purpose 
of the experiment and none had any significant 
degree of acquaintance with the Rorschach 


blots. 
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All Ss were administered a pair of Ror- 
schach cards with standard instructions on 
three occasions. There was an initial control 
administration followed immediately by an 
intervening task, either Motor Inhibition or 
Hyperactivity. Immediately following this task 
the second pair of cards was administered. 
After a brief “recovery” period following the 
second administration, the alternate interven- 
ing task, Motor Inhibition or Hyperactivity, 
was presented. This task was followed imme- 
diately upon its expiration by the final admin- 
istration of two Rorschach cards. The sequence 
was, therefore, a control administration initial- 
ly for all Ss with half undergoing Inhibition 
prior to Hyperactivity and the remaining Ss 
exposed to the experimental conditions in re- 
verse order. The particular order assigned a 
subject was based on prior randomization of 
conditions. 

The pairs of Rorschach cards employed were 
II and IV, I and III, and VII and VIII. Each 
pair included a chromatic and achromatic card 
selected from cards thought most likely to yield 
a sufficient range of movement responses. The 
order of pairs was systematically varied in 
such a manner as to insure that each set ap- 
peared in each order an equal number of times, 
thus permitting some control over the stimu- 
lus value of the blots themselves. Instructions 
for the Rorschach administration were stand- 
ard with an inquiry following. The Ror- 
schachs were scored independently by two 
clinicians following in general the method of 
Klopfer and Kelley [4]. 

In the Motor Inhibition condition, sub- 
jects were asked to walk toward E from a 
distance of several yards. At a signal, they 
were instructed to “freeze in place” and to 
maintain a rather awkward posture without 
moving for a five-minute period. At regular 
intervals E insisted that § maintain this posture 
without any movement at all. This proved to 
be a difficult task, but the Ss did their best to 
avoid any movement. At the end of the time 
period § was immediately seated and handed 
the appropriate Rorschach card. Following 
the Rorschach administration five minutes 
were permitted as a respite before the next in- 
tervening task began. 

The Hyperactivity condition consisted of a 
five-minute period of vigorous calisthenics. The 
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S was instructed to stand in the center of the 
room and to begin the exercise at a signal. 
Two minutes of rapid straddle-jumping, two 
minutes of touching toes, and one minute of 
deep-knee bends were required. These proved 
to be extremely taxing even for college stu- 
dents serving as mental hospital aides. No res- 
pite was permitted during this period, and as 
soon as it was ended S was seated and the next 
Rorschach cards were administered. Upon 
completion of the total experimental session, S 
was interviewed for his introspections and re- 
actions to the experimental procedures. 


Table 1 
Rorschach Response Totals under Three 
Experimental Conditions 











(N = 24) 
Conditions is : Determinants 
R M FM m Color Shading 
Control 202 30 ~=«=21 2 13 28 
Inhibition 221 49 40 5 14 19 


Hyperactivity 212 30 31 8 16 33 








Results 


Table 1 presents totals for major determi- 
nants of the Rorschach produced by the Ss in 
each of the three conditions of the experiment. 
Color and shading have been combined into 
broader categories because of the small number 
occurring when finer breakdowns into FC, CF, 
C, or Fc, C’, k, and K are attempted. Signifi- 
cance of the differences between the data for 
each pair of experimental conditions was deter- 
mined by the use of the linked-pair ¢ test. In 
the case of the movement response the value of 
p/2 was employed since the hypotheses called 
for specific directional differences. 

As inspection of Table 1 reveals, differ- 
ences between the various conditions in the 
number of responses (R) produced by the Ss 
are comparatively slight. No significant differ- 
ence in R emerged between any pair of con- 
ditions. It does not appear that Inhibition or 
Hyperactivity have markedly influenced the 
sheer output of the subjects in Rorschach per- 
formance. This absence of any marked differ- 
ences in R is important because it enhances the 
possibility that any changes in determinants or 
other categories are specific and not merely a 
function of a general alteration in responsive- 
ness. The dependence of all categories on R 


has tended to cloud the significance of other 
experiments on the meaning of Rorschach de- 
terminants [1, 3]. 

With respect to the major experimental hy- 
potheses, inspection of Table 1 reveals a de- 
cided increase in M following Inhibition com- 
pared with production of this determinant in 
the control administration or following Hy- 
peractivity. This difference is significant at 
p/2<.05 for Control vs. Inhibition and Hy- 
peractivity vs. Inhibition. Control and Hyper- 
activity do not differ in production of M. 
Tests of the significance of differences in M as 
a proportion of R for the three conditions do 
not alter these findings. It is apparent from 
Table 1 that the increased number of re- 
sponses following Inhibition is made up by an 
equivalent increase in M alone. These data, 
therefore, do not permit us to reject the first 
two hypotheses and add further confirmation 
to the theory and results obtained by Meltzoff, 
Singer, and Korchin [6]. 

With respect to the third hypothesis pre- 
dicting a decline in M following Hyperactivi- 
ty, the data indicate no difference between the 
Control and Hyperactivity conditions. Despite 
a slight difference in the predicted direction 
when M is taken as a proportion of R, this 
difference is not great enough to reject the null 
hypothesis. It seems clear that despite the ob- 
vious physical exertion and subsequent fatigue 
suffered by Ss in the Hyperactivity condition 
the decline in M anticipated did not occur. 
Possible alternatives explaining these results 
will be discussed below. 

Moving beyond the specific hypotheses of 
this study, the data reveal other relationships 
that merit consideration. For reasons cited 
previously, the M response as originally post- 
ulated by Rorschach was separated from the 
other movement percepts (FM and m). It is 
apparent, however, that following Inhibition 
the increase in the nonhuman movement deter- 
minants compared with the Control results is 
as great as for M. The differences between 
Control and Inhibition for both FM plus m 
are also statistically significant (p<.05). 
There is no significant difference, however, be- 
tween Inhibition and Hyperactivity in either 
FM or FM plus m. The difference in FM be- 
tween Hyperactivity and Control is not sig- 
nificant. When m responses are added to FM, 
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the difference between Hyperactivity and Con- 
trol is significant at p<.05. 

The content of the various types of move 
ment responses was analyzed to determine 
whether any further evidence would emerge 
with relevance for the hypotheses. The M, 
FM, and m responses were rated separately by 
independent raters for Active vs. Static motion 
and for Aggressive vs. Passive content. ‘There 
was an increased number of Static M (“Man 
standing” as against “Man running’’) fol- 
lowing Hyperactivity compared with the other 
conditions. This result tends to support the 
third hypothesis. Nevertheless, the results of 
this analysis add little as the various conditions 
of the experiment otherwise yielded relatively 
similar proportions of the Active-Static and 
Aggressive-Passive types of content for the 
three kinds of movement responses. 

While no differences are apparent in the 
color response, the decrease in shading re- 
sponses for the Inhibition condition in con- 
trast with Control and Hyperactivity ap- 
proaches significance (p >.05<.10). This re- 
sult parallels a similarly unanticipated finding 
reported by Goldman and Herman [2]. Finer 
analysis of the shading responses reveals differ- 
ing proportions of Fc, C’, and K for the three 
conditions, the most striking change being a 
sizable increase in K following Hyperactivity. 
These data are not statistically reliable, how- 
ever. 


Discussion 


The results of this investigation appear to 
lend further support to the hypothesis relating 
motor inhibition and Rorschach Human Move- 
ment responses as set forth by Werner [11] 
and Meltzoff, Singer, and Korchin [6]. De- 
spite the change in the experimental design 
and in the nature of the inhibition situation 
the results are quite similar to those obtained 
in the previous experiments. The phenomenon 
of the lowered threshold for movement per- 
ception following a period of self-restraint 
seems experimentally reproducible and merits 
further exploration of its parameters. The 
larger issue of the possible relationship of 
motor inhibition to the thinking process and to 
creative fantasy remains unresolved here. 
Nevertheless, the experimental support af- 
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forded by these studies to Rorschach’s obser- 
vational acumen in relating motor behavior 
and the human movement percept suggests 
that his other empirical association, M and 
creative fantasy, is worthy of extensive ex- 
ploration. 

A persistent problem in connection with the 
hypotheses is the and 


inanimate movement responses. The present 


role of the animal 
experiment suggested that these responses, al 
though lacking to some extent the empathic 
quality used by Rorschach as criteria for true 
M, nevertheless increase following Motor In- 
hibition. In other experiments [6, 2] FM and 
m when added to M in some cases enhanced 
the magnitude of the phenomenon. Yet the in- 
crease in movement responses following motor 
restraint is most striking for M. To the extent 
that the inhibition condition involves for some 


subject frus 


a concomitant discomfiture and 
tration, a certain amount of hostility might be 
generated. If FM is interpreted as a symbolic 
or immature expression of primitive drives, it 
may serve for some subjects as a medium for 
expressing their anger at the period of restraint 
enforced upon them. Lacking data or further 
clues, however, an explanation of this sort 
must remain highly speculative. 

Similar speculation seems called for at 
present in order to explain the finding of a 
trend toward a decrease in shading responses 
following Inhibition. The most parsimonious 
solution may be that each subject’s capacity or 
tendency for production of Rorschach responses 
is fairly limited. Following Motor Inhibition 
there is some increase in R, but scarcely in 
proportion to the increase in M, FM, and m. 
The subjects, therefore, may be acting within 
rather circumscribed response limits, and in- 
stead of producing more responses, tend to 
shift the emphasis in their percepts toward 
movement to the neglect of shading with its 
finer nuances. Whether an explanation in 
terms of the relation of shading to anxiety is 
applicable here is difficult to say. Certainly, the 
proportions of the various components of the 
shading determinant in the Control series 
maintained their relative positions following 
the Motor Inhibition condition. 


When we turn to the nature of the Hyper- 
activity condition, several knotty problems 
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present themselves, It seems fairly clear that 
the hypothesized influence of Hyperactivity, 
except for the trend suggested by the content 
analysis, is not supported by the data. Non- 
human movement responses, as a matter of 
fact, were observed to increase rather than de- 
crease. Before rejecting the third hypothesis in 
its general form, however, it may be well to 
reconsider the Hyperactivity situation and to 
try to understand what specific influences may 
have been involved for the subjects. A quali- 
tative difference in the effect of the experi- 
mental conditions deserves mention since it 
may well help in explaining some of the un- 
expected results. ‘The Inhibition task, despite 
its difficulty, evoked little comment or criticism 
from Ss. In sharp contrast, however, was the 
vivid reaction by most Ss to the Hyperactivity 
task. ‘The enforced physical activity without 
any respite in hot weather left them quite 
fatigued but also frustrated by their inability 
to retaliate with hostility. Some Ss revealed 
that they became increasingly embittered as 
time passed, Since, for these college students, 
the experimenters were figures in authority, 
they were put in somewhat of a conflict con- 
cerning expression of hostile feelings. For 
some Ss, the exercise condition evoked un- 
pleasant memories of enforced calisthenics dur- 
ing their military service under harsh sergeants 
or lieutenants. Hence they carried into the 
immediately presented Rorschach not only a 
sense of physical fatigue but frustration and 
suppressed anger. It may be therefore pre- 
sumed that following exercise psychological 
factors in addition to those along a Motor In- 
hibition-Hyperactivity continuum were oper- 
ative. 

The trend toward increased K might sug- 
gest an increased anxiety factor here. Similarly 
one might speculate that, although in general 
a decrease in M might be expected following 
Hyperactivity, in this case the decrease was 
counterbalanced by an increase as a result of 
a “psychic” inhibition of hostile impulses to- 
ward E. Further research designed to dis- 
tinguish between the effects of motor and “‘psy- 
chic” inhibition might help to clarify some of 
the unanticipated findings of the present study. 


Summary and Conclusions 


This experiment was one of a series de- 
signed to test certain hypotheses relating the 
Rorschach human movement response to 4 
more general psychological framework. Previ- 
ous research had suggested a relationship be- 
tween Rorschach’s M and motor inhibition 
with an increase in perception of movement 
responses following a period of motor in 
hibition. In the present study a more gener- 
alized inhibition task, “freezing’’ in place in an 
awkward posture, and a contrasting condition 
of hyperactivity (five minutes of vigorous cal- 
isthenics) were employed. On the basis of Ror- 
schach’s work and Werner's sensory-tonic 
field theory, it was hypothesized that, follow- 
ing Motor Inhibition, Ss would show an in- 
crease in M responses to Rorschach blots com- 
pared with a control administration. Similarly, 
it was predicted that Ss would show a corres- 
ponding decrease in M following Hyper- 
activity. 

The Ss were 24 male college students who 
received pairs of Rorschach cards in an initial 
Control administration and immediately fol- 
lowing intervening conditions of Motor Inhi- 
bition and Hyperactivity. The Rorschach card 
pairs were systematically varied throughout 
the three conditions and the sequence of the 
Hyperactivity and Motor Inhibition conditions 
was alternated. 

There was no difference between the Con- 
trol and Hyperactivity conditions in the num- 
ber of M produced. In support of the first hy- 
pothesis a significant increase in M appeared 
following Motor Inhibition. The Inhibition 
condition produced significantly more FM 
plus m (nonhuman movement responses) than 
Control but not significantly more than Hyper- 
activity. The second hypothesis, relating to the 
effects of Hyperactivity, was not supported 
despite the fact that content analysis provided 
some suggestion of a trend toward more static 
types of movement responses following Hyper- 
activity. Qualitative considerations suggested 
that the specific conditions of the Hyperactivity 
situation introduced psychological variables 
from continua other than the Inhibition-Hy- 
peractivity dimension. The support for the hy- 
pothesis relating Motor Inhibition and pro- 
duction of movement responses suggests that 
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this vicarious relationship of motor and sensory 
processes [11, 6] merits further investigation 
with the goal of ultimately uniting motor inhi- 
bition, motion perception, and the capacity for 
creative imagination within a common concep- 
tual framework. 
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A Pattern Analysis of Structural Characteristics 
of the Rorschach Test in Schizophrenia » 


]. Warren Thiesen 


Veterans 


The diagnosis and study of schizophrenia is 
an area in which the Rorschach method has 
found some of its major applications since the 
time of its introduction. In identifying the 
characteristic productions of schizophrenic 
patients, and in validating Hermann Ror- 
schach’s original findings in this area [9], 
most early investigators were concerned with 
identifying specific scoring categories which 
had differentiating power with reference to 
the disorder il, 8]. 

With increasing clinical application of the 
technique, attention has been increasingly 
focussed upon evaluation of each Rorschach 
record as an interrelated whole, and this pro- 
jective technique has gained wide recognition 
as one of the most penetrating psychological 
instruments for describing the structure of the 
whole personality. 

Yet in this area of personality patterning, 
in which the Rorschach method should have 
its most unique value, most progress has been 
made at a clinical-intuitive level, rather than 
at a systematic experimental level. That is, 
there has been very little experimental demon- 
stration of specific relationships between re- 
sponse patterns and personality syndromes. In 
clinical use of the technique, such complex re- 
lationships have generally been inferred from 
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accumulated clinical experience, or subjective 
frames of reference, supplemented by certain 
published norms relevant to single Rorschach 
variables [2, 3, 4, 5, 6, 7, 8, 10]. 

The present analysis is one part of a larger 
research concerned with the investigation of 
schizophrenia, approached by systematic com- 
parisons between clinically observed reaction 
patterns Rorschach test findings. The 
several objectives of the entire research are 


and 


described in the introductory paragraph of the 
first paper in this series [3]. The particular 
aim of this analysis is to determine the exis- 
tence of specific Rorschach response patterns 
associated with schizophrenia and to consider 
their differentiations 
among members of this clinical group in terms 


possible relevance to 
of level of integration and prognosis. 

It is hoped that the 
described will offer some preliminary leads to 
the important task of developing a body of ex- 
perimental data relevant to basic Rorschach 
Test patterns which are clinically meaningful 


investigation to be 


and valid. A more immediate and specific aim 
of this research has been to suggest characteris- 
tics of schizophrenia which have become appar- 
ent through analysis of the Rorschach produc- 
tions of one patient population and can be des- 
cribed for further study and evaluation on 
other populations. 

It is to be emphasized, however, that this 
approach should not be construed as carrying 
the implication that a strictly objective, statis- 
tical analysis can be sufficient for the full 
clinical interpretation of a Rorschach protocol. 
It is felt, rather, that the proper evaluation of 
the qualitative and individual characteristics of 
each Rorschach record can become more mean- 
ingful in relation to more exact knowledge of 
those structural syndromes which identify and 
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describe the group of which the individual is ciated with the various schizophrenic reactions 
a member. was undertaken through a pattern analysis.° 
The first step consisted of reducing the 
Subjects numerical values of each Rorschach variable to 
The subjects used in this analysis were a three categories of magnitude: low, medium, 
patient population of 60 schizophrenic adults, nd high. This was done by setting one sigma 
so diagnosed by the neuropsychiatric staff of distance above and below the mean of the 
Michael Reese Hospital, and a representative control group as the limits of the medium cate- 
control group of 157 gainfully employed ory, or establishing the nearest approximation 
adults. The controi group was the population © this criterion as category limits for those va- 
on which the 1949 revision of the Beck scoring riables whose distributions were too skewed for 
norms was based. The selection of this group the utilization of the standard deviation. In 
is described in a previous paper [3]. 3 of the 20 scoring categories, the extent of 
variability was so small that it was more prac- 
tical to utilize only two categories, low and 
The Rorschach tests of all subjects in both high. The category limits which were estab- 
grours were individually administered and lished in this manner, as a basis for the pattern 
scored according to the Beck system, utilizing analysis, are shown in Table 1. 
the revised norms [2]. Twenty of the major The next step consisted of the actual in- 
Rorschach scoring categories (those whose spection for the recurrence of particular com- 
scoring is relatively more objective and whose 
clinical significance has been more widely *Only those protocols containing 10 or more scor- 
; ; : able responses were used in the enalysis 
studied ) were selected as the variables for this hie selina Standen te Di Tons Debbi ten 
analysis and are shown in Table 1. An empir- vignioten is ‘applicability gel mre ag tore tos 
ical search for Rorschach test syndromes asso- method to these data. . 


Table 1 


Limits of Categories: Low, Medium and High, for Rorschach Variables on 
which Patterns Shown in Table 2 are Based 


Procedure 














Variable* Low Medium High 

R : ee ee ee ha ee eee 0-14 15-50 51 and Above 
P+h - i — dedeatinien 0 — 68.9 69.0 — 89.9 90.0 and Above 
= va ; aaa as 0-4 5-9 10 and Above 
a be PRIN. 5 ns eat gl 0 — 33.9 34.0 — 59.9 60.0 and Above 
See ae S aicaliobdnceasiine 0-7.5 8.0 — 37.0 37.5 and Above 
6) ee ; a a ae 0 — 56.4 56.5 — 83.8 83.9 and Above 
D%..... a eR ORE ae 0- 58.9 59.0 — 84.9 85.0 and Above 
a ee ae Lee ee 0-6.1 6.2 — 34.3 34.4 and Above 
Dda%...... — . re ee 0-0.9 1.0 -— 16.5 i6.6 and Above 
C (No. of, including blends in which form 

does not enter, such as C- Y) : 0 1 2 and Above 
Sum C (Total Color Weights) -............... 0-0.5 1.0 — 5.0 5.5 and Above 
Sum C less Sum FC (Weights) -........ 0 0.5 — 4.0 4.5 and Above 
No. of M Responses Ves ae ae 0 1-5 6 and Above 
No. of M-— Responses..................---------~ 0 1 2 and Above 
No. of S Responses CRMC w, WORE Sere 0 1-3 4 and Above 
EEL 0 — 1 and Above 
No. of Anatomy Responses 

(excluding Sex Responses) -......................--- 0 1-2 3 and Above 
BRIT IF I on ccsinidcreededicnsesindintmeniainiisinanait 0 —— 1 and Above 
re aii mistttinisnininicctcninininntnt 0 —_— 1 and Above 
eo ee 0 1 2 and Above 





*Indicated by standard abbreviations as employed in ‘“‘Rorschach’s Test,” Vol. I, by S. J. Beck [2]. 
Sum (F-++-, F—, F) 
tF% = 2 * 100 
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binations of categories, or patterns. Each sub- 
ject in each group was identified by a serial 
number, which made it possible to list all of 
the subjects characterized by any Rorschach- 
variable level under consideration. Duplicate 
sets of worksheets, each consisting of forty 
horizontal rows, were prepared. Each row 
represented one of the extremes (high or low) 
of each of the twenty Rorschach variables, and 
was identified by the appropriate descriptive 
label (e. g.. F+%: High). To the right of 
each descriptive label entries were made of 
serial numbers of all subjects characterized by 
that variable and level. These serial numbers 
were entered in two columns, one for controls 
and one for patients. As these work sheets were 
prepared in duplicate, it was possible to make a 
series of comparisons of all possible combina- 
tions of categories for recurrence of cases. 
This was done by a process of successive align- 
ment of each row of one worksheet under every 
row of the duplicate sheet. Combinations 
which appeared on inspection to be discrimi- 
nating (e. g., with serial numbers of patients 
predominating) were immediately recorded, 
with the appropriate serial numbers, on a new 
worksheet on which discriminating combina- 
tions were represented (e. g., F+%: Low; Z 

















Table 2 
Patterns Associated with Schizophrenia 
De- Frequency Chi | P 
Pat-|  scrip- Controls Patients Square 
tern tion (N=157) (N60) 
AlAn: High|| 1 | 12 | 25.5 |<.001 
Sex: High || (0.6%) | (20.0%) 
B| F+%: j a PE a 
Low 2 11 19.4 | <.001 
Z Score: 
Low (1.3%) (18.3%) 
C| FC+: | " 
Low 0 6 12.2 | <.001 
M: Low 
\4A%: Low (0.0%) (10.0%) 

D| F+% | cao 
Low 1 9 | 16.9 | <.001 
| P: Low 

1 A%: Low || (0.6%)| (15.0%) 

E| DW: | | 

| High 1 7 | 12.0) <.001 
FC+: 
Low (0.6%) (11.7%) | 
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Score: Low). All of the pairs which showed 
any tendency to be discriminating were then 
tested with every other variable for possible 
combination into more sharply differentiating 
triads. This technique can be characterized as 
essentially a manual method of getting infor- 
mation relative to pattern clustering which 
could have been secured mechanically by means 
of the IBM machine punch-card technique. 

The final task was the selection of the more 
outstanding combinations which could reason- 
ably be identified as patterns associated with 
schizophrenia. Such a selection was made by 
accepting only those patterns which met the 
following two criteria of discrimination: (a) 
an incidence of at least 10% in the patient 
group and less than 2% in the control group, 
and (4) statistical significance at the .01 level 
of confidence. 


Results 


From this analysis five patterns emerged 
which met the above criteria. They are shown 
in Table 2. 

It will be noted that the degree to which 
these patterns discriminate between the two 
groups, and the statistical significance of that 
discrimination, considerably exceed the original 
criteria. 


statistical signifi- 


cance, the assumption that the two groups were 


For determining 
derived from a homogeneous population with 
respect to pattern occurrence was tested by 
chi square, using Yates’s correction for con- 
tinuity. The assumption of homogeneity is ex- 
cluded at a criterion beyond the .001 
level of confidence in the case of each of the 
five patterns. 


well 


Table 3 
Frequency of Pattern Occurrence in Control 
and Patient Populations 
Group Number of Patterns Total 
None One or More 
Controls 152 5 157 
(96.8%) (3.2% 
Patients 31 29 60 
(48.4%) 


(51.6%) 
Chi Square: 63.6 
p< .001 


The number of individuals in each of the 
two groups who are characterized by (a) ab- 
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sence of patterns, and (4) presence of one or 
more patterns, is shown in Table 3. The extent 
to which these patterns are associated with 
schizophrenia and are rare in a nonpatient pop- 
ulation is reflected in the finding that one or 
more patterns characterizes 48.4% of the 
patients and only 3.2% of the controls. While 
this degree of association is not sufficient to per- 
mit accurate prediction from negative findings 
(e. g., absence of patterns), it is evident from 
the very high chi-square value (calculated with 
the correction for continuity) that the two 
groups are not drawn from a homogeneous 
population with respect to incidence of 
patterns. 


Table 4 
Frequency of Pattern Occurrence in Control and 
Patient Populations 
(Breakdown of i‘able 3) 


Grour Number of Patterns Present 


To- 

1 4 tal 

Controls 162 5 0 “— “0 167 
(96.8%) (3.2%) 

Patients aa et. ee 3 23—s_ 60 


(51.6%) (35.0%) (5.0%) (3.38%) (5.0%) 


Table 4 shows a more detailed analysis of 
the data in Table 3, indicating the number of 
individuals in each group characterized by spe- 
cific numbers of patterns. It is evident from 
this breakdown that the presence of more than 
one pattern in an individual is a finding which 
occurred only in patients. Although only 8 of 
the 60 patients in the sample were character- 
ized by the finding of more than one pattern, 
an analysis of the particular pattern combina- 
tions occurring among these 8 patients was 


Table 5 


Frequencies of Different Combinations of Patterns 
Among the Eight Patients Characterized by 











Pattern Combination Frequency 


(Number of Patients) 





ABCD 
ACD 
AD 
ADE 





oO me es et re re YY) 
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Total Combinations 
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made in an effort to throw further light on the 
nature of certain Rorschach syndromes asso- 
ciated with schizophrenia. Table 5 specifies the 
various pattern combinations occurring among 
the “two or more pattern” individuals. 

An additional finding, which is not pre- 
sented in tabular form, concerns the relation- 
ship between the patterns and productivity. 
The possibility that productivity differences 
might account for some of the discriminating 
capacity of the patterns would appear to be ex- 
cluded by the fact that the difference between 
the mean response totals of the two groups was 
small and statistically insignificant. As a second 
check, the patient and control groups were 
each divided into high and low productivity 
groups, using the productivity mean of the 
control group as the cutting point. Exami- 
nation of pattern occurrence in the subgroups 
showed that in each of the two productivity 
groups, presence of one or more patterns 
differentiated the patients from the controls in 
almost exactly the same proportions as were 
observed in the undivided groups. Productivity 
differences appear, therefore, to be noncontrib- 
utory to the discriminating power of these 
patterns. 


Discussion 

Detailed interpretation of the psychopatho- 
logical significance of these clusters is beyond 
the scope of this paper. It may be noted in 
passing that Pattern A is a combination of re- 
lated content categories, which is suggestive of 
somatic preoccupation and release from cul- 
turally determined inhibition. Pattern B is re- 
lated to intellectual factors and may reflect a 
concomitant change in reality testing and in- 
tellectual drive. The significance of the inter- 
relationship between the variables in Pattern 
C will require further study. The other triad, 
Pattern D, suggests various aspects involved 
in a generalized estrangement or reduction of 
effective contact with the environment. The 
combination in Pattern E is suggestive of an 
insufficiency of emotional rapport with the 
outer world and a concomitant disturbance 
of the thinking process. 

A word of caution is indicated regarding 
the interpretation of Pattern A. While it 
occurred in only one of the 157 normal adults, 
it is quite conceivable that in selected popula- 
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tions representing certain specialized interests, 
and particular kinds of sophistication, this clus- 
ter might be considerably more common in the 
absence of psychopathology. 

An over-all view of these patterns is in or- 
der. In the event that their discriminating 
power is confirmed, what would be the practi- 
cal implication of the discovery of one or more 
of these patterns in an individual patient? The 
analysis shown in Tables 3 and 4 may throw 
some light on this question. It is evident from 
Table 3 that almost half of the patient group 
is characterized by the presence of one or 
more of the five patterns, and that these 
patterns appear in only about 3% of the con- 
trol group. A follow-up of these 60 patients, 
now in progress, may help to explain the ap- 
proximately 50% division of the group into 
those with and those without patterns. If such 
follow-up, or any similar follow-up study, 
should show a relationship between pattern 
occurrence and chronicity of psychosis, the 
occurrence of one or more of these patterns in 
an individual at or before onset of mental ill- 
ness might take on important prognostic sig- 
nificance. 

It can be seen from the further analysis 
shown in Table 4 that the five members of the 
control group in whom these patterns were 
found had only one pattern each, while two or 
more patterns occurred in eight, or 13.3%, of 
the patient group. If these findings are con- 
irmed, the existence of more than one pattern 
in an individual may have considerable diag- 
nostic significance, since such a finding occurs, 
thus far, only in patients on whom a psy- 
chiatric diagnosis of schizophrenia was made. 
Analysis of Table 5 reveals the additional sig- 
nificant finding that the combination of Pat- 
terns A and D (with or without other pat- 
terns) occurs in six of the eight patients with 
two or more patterns. From this combination, 
a larger pattern composed of five variables is 
at once apparent. This “AD” syndrome can be 
described as follows: 

Characteristics: An: High; Sex: High; 
F+%: Low; P: Low; A%: Low. 

Frequency: Controls: 0 (0.0%) ; Patients: 
6 (10.0%). 

It is of interest that six of the nine patients 
who show the estrangement, or generalized re- 
duction of effective contact with reality, 
characteristic of Pattern D also show a pre- 


occupation with the body and with sexual 
anatomy. Thus, the five-variable cluster seen 
in the AD combination would seem to repre- 
sent a turning of interest from the environ- 
ment to the body, leading specifically to a pre- 
occupation with the genitalia and other body 
organs. This total syndrome may represent a 
real regression to a level of adjustment an- 
alogous to an earlier stage of development. 
Because of the severe withdrawal implied by 
this constellation, one might expect that the 
six patients showing the AD syndrome repre 
sent more extensive personality disintegration 
than patients showing only one pattern or no 
patterns. 

The conclusions to be drawn from the re- 
sults of this analysis must be limited by the 
following considerations. First, it should be 
noted that the patterns isolated in this investi- 
gation cannot be regarded as an exhaustive 
enumeration of Rorschach test syndromes 
associated with schizophrenia, inasmuch as 
only twenty of the more important scoring 
categories were used in the analysis. 

Secondly, it should be recalled that the 
groups on which these patterns were tested 
were the same groups from which they were 
derived. Since the patterns still lack cross- 
validation, they must be regarded as tentative 
until confirmed on new groups. They have 
been described and made available to facili- 
tate the further research upon which the de- 
termination of their diagnostic and prognostic 
value must rest. In this connection, it is to be 
emphasized that very careful selection of popu- 
lations is the essence of a fair test of the valid- 
ity of these Rorschach patterns. If any or all 
of the patterns stand the test of such further 
research, they might then have clinical useful- 
ness in early detection of a schizophrenic pro- 
cess or as an aid to determining the gravity of 
an initial psychotic episode. 


Summary 


A pattern analysis of Rorschach scoring 
summaries was undertaken with the object of 
identifying certain quantitatively expressed 
syndromes or combinations of Rorschach vari- 
ables which are associated with schizophrenia 
and are nonoccurring or extremely rare in 
normals. Comparisons were based on a patient 
group of 60 adults with presenting diagnosis 
of schizophrenia and a representative control 
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group of 157 gainfully employed adults. A 
method of expressing moderate deviations in 
Rorschach variables by means of three-category 
scales and of seeking specific discriminating 
patterns of deviations by means of a systematic 
inspection technique is described. 

Five tentative patterns are reported, which 
appear, on the basis of this analysis, to be 
associated with schizophrenia. Their clinical 
and theoretical implications are discussed. 


Received March 4, 1952. 
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An Inverted Factor-Analytic Study 
of Schizophrenics 


Wilson H. Guertin 


Beatty Memorial Hospital 
W estville, Indiana 


In a previous paper by this writer [2], a 
conventional factor analysis of schizophrenic 
symptoms was reported. The study disclosed 
six factors which were important in explaining 
or synthesizing the symptom-data gathered. 
These six factors were proposed as concepts 
which could be utilized in exploring the schizo- 
phrenic domain more effectively than present 
subtype classifications. 

It was apparent that very little could be 
said about the schizophrenics as individuals 
since the conventional type of factor-analytic 
approach does not provide such information 
readily. Two procedures for further investi- 
gation were suggested at that time, one of 
which will be reported here. 


Burt’s inverted factor-analytic technique 
[3] studies the clustering of individuals rather 
than the clustering of symptoms. With such an 
analysis comments are more or less confined to 
individuals rather than symptoms, and the 
potential adequacy of typing people in classifi- 
catory terms can be determined. The corre- 
lations in the intercorrelation matrix are be- 
tween people instead of between behavior 
variables as in the conventional type of analy- 
sis. Beyond this point any method of factor 
analysis can be utilized to discover the factors 
which will suitably explain the intercorrelation 
matrix. The resultant factors will then repre- 
sent ideal types of individuals among those in- 
cluded in the study. 


Procedure 


The reader who is interested in a more de- 
tailed description of the procedure is referred 
to a previous paper [2] or the original report 
[1]. The subjects in this study were 20 diag- 


nosed schizophrenics on the admission ward 
of one state hospital. Twelve white females 
and eight white males were employed in this 
study and were selected so as to provide a good 
sampling by diagnosis from the larger group 
of 100 subjects utilized in the previous study. 
Some attempt was made to keep the paranoid 
and mixed types down to a reasonable number 
since they predominated. Of these subjects 
nine were diagnosed mixed type, five paranoid, 
five catatonic, and one hebephrenic. 


The original criteria for utilization in the 
sample were largely by exclusion since it would 
be circular to use schizophrenic symptoms as 
criteria in an empirical study of schizophrenia. 
These criteria were such as to exclude neuro- 
logical and involutional problems. 

The symptoms to be studied were abstracted 
from discussions of schizophrenia in. two stand- 
ard psychiatric textbooks. A total of 77 rele- 
vant symptoms was used in the collection of 
data but the frequencies of occurrence were too 
few for some, so that the final number em- 
ployed in calculating intercorrelations was 
only 52. The symptoms employed were as 
follows: 


Destructive 
Aggressive-combative 
Disagreeable-unpleasant 
Erratic activity level 


Poor time orientation 
Poor recent memory 
Systematized delusions 
Concern over right 
and wrong 
Disturbed sleep 
Noncooperative 
Lack of social graces 
Bizarre delusions 
Cries at times 
Visual hallucinations 


Guilt feelings 

Sex concern 
Underweight 
Over-religious 
Delusions of persecution 
Suspicious 


1The author wishes to acknowledge his indebted- 
ness to A. P. Bay, M.D., Superintendent of Man- 
teno State Hospital, where the data was gathered. 
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Concern over inventions 
Retarded movements 
Low activity level 
Seclusive 

No poverty of ideas 
Poor attention 
Flattening 

Heedless of needs 

Poor appetite 
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Delusions 
Misinterpretations 
Somatic delusions 
Feelings of reference 
Somatic concern 
Persistent inappr. mood 
Blunting of ties 
Auditory hallucinations 
Perseveration of phrases 





Mood-environ. 
disharmony 
Preoccupied 


Poor place orientation 


Superior-grandiose 
No narrowing of interests Mood-ideas disharmony 
Lack of ambition Manneristic 

Perplexed 
Silly 


Incoherent-irrelevant 


Concern over politics 
Does not work 
Vague abstract terms 


The subjects were rated by this investigator 
for the presence or absence of these symptoms 
after one or more mental status interviews. An 
attempt was made to rate all these symptoms 
at a single point in time so that a true cross- 
sectional picture would be presented. It was 
necessary to consult with one or more attend- 
ants in the gathering of some of these data. 
The symptoms are labeled briefly here for con- 
venience but more extensive definitions are 
given in the original report [1 ]. 


The correspondence between individuals in 
their symptom ratings was expressed in terms 
of tetrachoric correlation coefficients. The reli- 
ability of these coefficients depends upon the 
number of symptoms rather than upon the 
number of individuals, as in the conventional 
intercorrelation calculations. Since the number 
of symptoms is only 52, the reliability of these 
coefficients may be expected to be rather low. 
However, this would not affect the factorial 
structure since unreliability among these coefh- 
cients would merely serve to reduce commu- 
nalities and hence diminish the magnitude of 
the factor loadings. This is so because the 
general factorial configuration is independent 
of the reliability of the coefficients. 


The reduced intercorrelation matrix (20x 
20) was obtained by using the highest coeffi- 
cient in each column as an estimate of the 
communality. Trial factoring provided better 
estimates of communality so that the third and 
final factoring utilized communalities which 
were closer to the true ones. 

Three tight, relatively independent clusters 
were quite easily identified by a conventional 


cluster analysis and were extracted by the 
multiple-group centroid method of factoring. 
Rotation of the oblique matrix to orthogonal 
structure permitted the calculation of residu- 
als. 


Results 


The factor matrix for individuals is seen 
in Table 1. Many relatively large loadings are 
found, particularly for Factor 1. All 
factors extracted seem to be monopolar group 
factors, with no signs of a general factor. ‘The 
sum of the squared factor loadings shows that 


three 


the first factor is almost twice as important as 
either the second or the third. It is important 
to notice that if one regards those factor load- 
ings of .40 or greater (italicized) as important, 
that 10 of the individuals showed loadings on 
only one factor. Seven individuals showed a 
mixture one individual 
showed a mixture of three factors. Two sub- 
jects could not be described in terms of either 
of these three types since their factor loadings 
were all below .40. 


of two factors and 


Table 1 
Oblique Factor Loadings for 
Schizophrenic Individuals 


Subj. Factor Subj. Factor 

No. 1 2 3 No. 1 2 3 
1 14 14 65 11 58 -.28 .20 
2 14 95 ~ «38 12 we S33 
3 47 =D 16 13 53 37 62 
+ -.03 -.27 -.38 14 81 12 14 
5 01 68 2% 15 49 45 716 
6 01 -.31 05 16 39 87 25 
7 47 ~ =A 19 17 -.05 34 8664 
8 31 24 80 18 66 -03 -40 
9 48 A12 = «13 19 ai # & 
10 aT *@ A 20 “Sa 1 St 





Rotating the oblique matrix to an ortho- 
gonal structure produced relatively little 
change in the configurational pattern of the 
matrix since the original factors were relative- 
ly independent. The highest correlation be- 
tween oblique factors was .35 between Factors 
2 and 3. The total sum of the squared factor 
loadings accounts for 90 per cent of the esti- 
mated communality, while 58 per cent of the 
total variance of the matrix is accounted for by 
the three factors. The estimated communality 
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accounts for 64 per cent of the total variance 
in the reduced intercorrelation matrix. 

Inspection will disclose the symptoms which 
best characterize each type of schizophrenic. 
The following symptoms are those which the 
three most typical individuals all displayed and 
that did not appear as features of either of the 
other two types. 

Type 1 symptoms were: Delusions of perse- 
cution, suspicious, feelings of reference, de- 
lusions, somatic concern, misinterpretations, in- 
coherent-irrelevant, flattening, no persevera- 
tion, good recent memory, no lack of ambition, 
no low activity level, no blunting of ties, and 
cooperative. 

‘Type 2 symptoms were: Perplexed, poor 
recent memory, not incoherent-irrelevant, no 
somatic delusions, no auditory hallucinations, 
no misinterpretations, not preoccupied, not 
silly, no mood-ideas disharmony, no delusions 
of persecution and no persistent inappropriate 
mood. 

‘Type 3 symptoms were: Preoccupied, silly, 
mood-ideas disharmony, place oriented, not 
perplexed, no guilt, not concerned over right 
and wrong. 


lable 2 
Intercorrelations between Symptom Factors 
and Type Factors 


Type Factors 


1 2 3 

Symptom Con- 

Factors Paranoid fused- Hebephrenic 

Withdrawn 

Excitement 

hostility —.23 —.22 -.07 
tetardation & 

withdrawal -.25 30 -.23 
Guilt- 

conflict 12 .22 -.20 
Confused- 

withdrawal -49 48 -.18 
Persecuted- 

suspicious 77 -.50 -.50 
Personality 

disorganization .25 —A0 55 





Table 2 shows the correlation between con- 
ventional symptom-factor loadings for individ- 
uals derived from a previous study of symp- 
toms [1, 2] and the factor loading by types. 
The individual loadings on the symptom 


factors were scores obtained by weighting the 
presence or absence of significantly loaded 
symptoms in the subject by the squared factor 
loading of the symptoms on the given factor. 
Thus each subject has six weighted score sums, 
one for each symptom factor. In effect, ‘Table 
2 defines the three types in terms of symptom- 
factor relationships. These correlations were 
based upon 18 subjects, subjects 4 and 6 being 
excluded since they were different from the 
others and would have to be considered sepa- 
rately. To provide for a better linear relation- 
ship between type-factor loading and symptom- 
factor loading index, the type-factor loadings 
were transformed by squaring them before 
calculating correlation coefficients. By plotting 
a person’s symptom-factor loadings the type 
can be predicted without direct reference to 
symptoms. 


Discussion of Results 

First, the factor matrix should be examined 
for evidence bearing on the ever present 
question of “Is there such a thing as schizo- 
phrenia?”’ No suggestion of a group factor 
which could be called “schizophrenia” is dis- 
closed. To the contrary, the types separate 
rather clearly with relatively little overlap. As 
suggested in the original report [1], it is possi 
ble that an individual may shift from one type 
to another in the course of time and thereby 
justify the use of the general classification, 
schizophrenia. However, this hypothesis must 
be experimentally demonstrated. 

Secondly, are some “schizophrenics” enough 
like one another to be classified under a sub- 
category? The factor matrix suggests that 
some individuals are very similar to others 
with respect to the schizophrenic symptoms 
studied. Ten, or one-half the sample, could be 
classified as one or another of the types. Six 
others could be classified as “mixed type” with 
features of two types, while only one had 
large loadings on all three factors. The two 
individuals who did not show loadings on any 
type were quite different from any of the other 
subjects. They were catatonics in the excite- 
ment stage with markedly bizarre delusions. 
Further factoring would have disclosed the 
catatonic excitement as a fourth type. The 
justification for subtype classification seems 
clearly demonstrated. The matrix indicates 
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that it would be well to use the classification 
of “Mixed type” with specified components 
for many individuals. 


Thirdly, what is the justification for using 
current diagnostic categories? Rather general- 
ly the obtained data confirm the usefulness of 
paranoid, simple, and hebephrenic subtype 
classifications. Further inspection brings out a 
fourth type—catatonic excitement. A possible 
fifth type, catatonic stupor, has been excluded 
from the sample because of the necessity for 
using subjects who could give verbal infor- 
mation about themselves. The currently used 
classification of mixed type seems important 
particularly in describing those individuals 
who show paranoid and hebephrenic features. 
Other mixed types proved to be relatively 
rare for this sample. 


The estimated communality, which consti- 
tuted 64 per cent of the total variance of the 
intercorrelation matrix demonstrates that 
“schizophrenics” are more similar than they 
are unique. With a larger N this communality 
would be still higher. Accounting for 90 per 
cent of this common variance by only three 
factors (types) is a very encouraging finding 
to the nosologist. The types disclosed are as 
follows: 


Type 1 (Paranoid )—Highly delusional and 
outgoing, with relatively intact thought pro- 
cesses aside from delusional aspects. This type 
conforms in all respects to the well-recognized 
paranoid schizophrenic. The only difficulty in 
identifying individuals of this type seems to 
be in the concurrence of Type 1 and 3 
characteristics in the same individual. This, 
incidentally, is the only concurrence of types 
that seems to be very important in leading 
to “mixed type” diagnosis. 

Type 2 (Simple)—Confused withdrawal 
with thought confusion which is crippling to 
the extent that they cannot compete in society 
and hence are forced to withdraw. In spite 
of thought impairment there is no disorganiza- 
tion of the personality leading to shows of in- 
appropriate effect. Withdrawal is more pro- 
nounced in this type than the other two. This 
type may conform to simple schizophrenics 
or mild catatonic stupors depending upon 
how these classifications are applied. For classi- 
fication as Type 2 an individual should be 
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withdrawn, show intellectual confusion, no 
hebephrenic silliness, and very little delusional 
thinking. 

Type 3 (Hebephrenic)—The chief char- 
acteristic is inappropriate show of affect. The 
individual of this type has relatively normal 
functions, but they are inappropriately demon- 
strated. Withdrawal and delusions are absent. 
While this type looks much like the hebephren- 
ic in Kraeplin’s classificatory system, the 
differentiae are fewer. This undoubtedly ex- 
plains the occasional concomitance with the 
Type 1 factor. Kraeplin’s hebephrenic type 
is defined so rigorously that one can hardly 
be a “hebephrenic” and a “paranoid” at the 
same time. 

Experimental graphic estimates of the type 
from known symptom-factor loadings for the 
individuals suggest that a more thorough 
study can establish meaningful “schizophrenic 
types” and define them in a practical fashion 
by symptom-factor loadings which are ulti- 
mately in terms of rated symptoms. 


Summary 


1. In a previous study the writer reported 
six symptom factors which were important in 
explaining the symptom data gathered on 100 
schizophrenics. Little could be said about the 
individual schizophrenics, thereby indicating 
the need for an inverted factor analytic study 
of the individuals. 


2. The present study presents the results 
of an inverse factor analysis of 20 schizo- 
phrenics who were rated on 52 symptoms. 

3. Multiple-group factoring disclosed three 
relatively independent factors which account- 
ed for 90 per cent of the estimated communal- 
ity. Type 1 corresponds nicely to the paranoid 
subtype, Type 2 represents a confused-with- 
drawal syndrome, and Type 3 embodies hebe- 
phrenic silliness. 

4. Ten subjects showed a loading (2.40) 
for only one type factor, while seven showed 
a mixture of two factors, and one had a load- 
ing for all three factors. The two cases which 
showed no loadings proved to be in catatonic 
excitement and would produce an additional 
factor in a more complete study. 


5. Classificatory implications were as fol- 
lows: (a) Present subtyping is supported in 
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general. (4) No support was found for a gen- 
eral factor “Schizophrenia.” (c) It should be 
possible to characterize individuals as being 
of one type or another much of the time. (d) 
The most frequent combination of types in 
mixed diagnosis was hebephrenic-paranoid. 
(e) Currently employed differentiae seem to 
be too stringent especially in the case of the 
hebephrenic. 


Received February 8, 1952. 
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The Use of the MMPI Profile for Prognosis and 
Evaluation of Shock Therapy’ 


Marvin J. Feldman 


University of Buffalo 


Hathaway and McKinley [8] originally de- 
signed the Minnesota Multiphasic Personal- 
ity Inventory (MMPI) to serve as a diagnos- 
tic tool for a number of separate clinical syn- 
dromes. The empirical findings of Schmidt 
[11] and Gough [3], among others, indicated 
the usefulness of various patterns of scores on 
the subscales as an aid to diagnosis. Harris 
and his associates [5, 6, 7] broadened the scope 
of profile analysis on the MMPI by discover- 
ing typical patterns of scores related to prog- 
nosis. With slight variations (chiefly in the 
magnitude of the scores) their findings were 
consistent for such diverse groups as neurotics 
treated by brief psychotherapy and psychotics 
treated by various types of shock treatment. 
The pretreatment scores of groups who subse- 
quently responded poorly to therapy in contrast 
to groups who responded favorably contained 
higher average scores on the psychopathic de- 
viate (Pd,), psychasthenia (Pt), and schizo- 
phrenia (Sc) scales. These signs were found 
in all groups studied; other prognostic signs 
were educed which were more specific to a 
particular group. Subsequently other studies 
[1, 4, 10] presenting data on the prognostic 
value of the MMPI profile have appeared in 
the literature. The discrepancies in the results 
of these various studies are marked. Without 
some clarification of these divergent findings, 
the prognostic use of the MMPI profile 
would seem to be limited. 

Carp [1], for example, concluded that high 
scores on the psychotic scales of the MMPI 
were favorable signs for insulin treatment 
of schizophrenia. In his study the unimproved 

1This study was part of a larger study made pos- 


sible by a research grant from the United States 
Public Health Service. 


group had average scores well within normal 
limits on both Pt and Sc. Hales and Simon 
[4], using a small sample, reported findings 
somewhat similar to those of Carp, Pacella, 
Piotrowski, and Lewis [10], on the other hand, 
found that the average preshock scores of 
both improved and unimproved schizophrenics 
were so similar that the test could have little 
prognostic value. 

The writer believes that the results and 
conclusions of the present study may furnish 
a basis for reconciling the apparent contra- 
dictions reported by previous investigators. 
Thus this study may clarify the prognostic 


use of the MMPI profile. 
The Low Profile Group 


The discrepant results among the previous 
prognostic studies may be a function of a num- 
ber of factors such as: 


(a) unreliability stemming from small-sized 
samples, 

(4) unreliable ratings of improvement at 
the termination of treatment, 

(c) use of a broad twofold rating of im- 
proved and unimproved obscuring dif- 
ferences which might appear in a three- 
fold classification, 

(d) inclusion of records with all scores 
within normal limits concealing differen- 
tial patterns. 


Perhaps the last named factor is most impor- 
tant. In most aspects of measurement, proper 
scientific procedure demands the inclusion of 
all cases. Slavish adherence to such a dictum, 
however, can in some situations produce gross- 
ly misleading results. Experience with the 
MMPI leads the author to believe that misses, 


376 


we 


- 


l- 


[= 


of 
n, 
S- 


Ss, 


The MMPI for Prognosis and Evaluation of Shock Therap) 377 


i.e., records in a clinical group with all T scores 
below 70, should be excluded in any attempt to 
work out the average profile for a particular 
clinical group. The inclusion of the “misses” 
may very well lead to meaningless averages in 
the sense that the profile is not characteristic of 
the scores obtained by subjects in the group 
who have elevated profiles, i. e., the test “hits.”’ 
Inclusion of test misses in psychotic groups 
especially seems likely to produce distorted av- 
erages. An undue proportion of psychotic “test 
misses’’ seem to have all 7 scores within nor- 
mal limits and to be supernormal in just those 
aspects of the test which are important for 
prognosis, namely on hypochondriasis (Hs), 
Pt, and Sc. When the scales are not K cor- 
rected on the records of these test misses, their 
T scores on these scales tend to be well below 
50 and the raw scores, of course, are very 
low—often zero to two points on Hs and 
three to nine points on Pt and Sc. Thus the 
inclusion of test misses in the analysis of 
groups of psychotics is likely not only to low- 
er the average magnitude of the scores but 
also to alter markedly the shape of the profile. 
Carp’s data [1] illustrate the point made 
in the preceding discussion. In his unimproved 
group all J scores on Sc are below 70 or 
above 90. Further his unimproved group 
seems to contain an undue proportion of super- 
normal profiles judging from results on Pt 
and Sc where 14 of 22 cases have TJ scores 
below 50. 


In the present study 15% (40 out of 224 
cases) of the records had all scores within 
normal limits. These records were not included 
in the analysis. It seems probable that previous 
prognostic studies contained varying propor- 
tions of test misses thus leading to the discrep- 
ant results. ‘These cases are not “test misses” 
in the usual sense of the word. Rather the 
ability of some psychotic patients to achieve 
extremely low scores on those scales heavily 
loaded with bizarre symptomatic items ap- 
pears to be related to the personality structure 
of these respondents. It may be significant 
that in the present study approximately half 
of these cases were diagnosed as paranoid 
schizophrenics. These subjects seem to be 
worthy of intensive study in their own right 
to determine in what way they differ from 


schizophrenics with elevated MMPI profiles. 
At any rate their ego structure is sufficiently 
intact to permit them to deny most symptoms. 
Their suspiciousness works too well, however, 
for their supernormal profiles as described 
above allow the clinician to differentiate 
these cases from neurotic test misses who are 
much less likely to have such consistently 
low scores on Hs, Pt, and Sc. 


Selection of Cases 

The present study included the MMPI 
records of all patients at the Langley Porter 
Clinic who were given a test before shock 
treatment between the years 1943 and 1948. 
This study contained some of the same records 
used in the studies by Harris, et al. [5, 6], 
but this sample is much larger than in any of 
the previous studies. Also an attempt was 
made to eliminate some factors which might 
lower both the validity and the reliability of 
the data. Thus a number of cases were elimi- 
nated from consideration because of the 
following restrictions: 

1. As already mentioned, records were not 
used which did not contain at least one T 
score of 70 or over (other than masculinity- 
femininity, Mf). 

2. Psychiatric ratings of improvement were 
obtained both at the termination of treat- 
ment and in follow-up periods. Cases with 
inconsistent ratings were discarded, e. g., re- 
covered at the end of treatment but unim- 
proved six months later. 

3. Records were also eliminated which 
contained excessively high scores on two in- 
dices of internal validity, a JT score over 70 
on the ? index and a raw score over 19 on 
the F scale. 

The final sample contained 184 cases tested 
before treatment was instituted. Post-treat- 
ment records were also available for 56 
cases. The preshock records were divided into 
three groups called recovered, improved, and 
unimproved. The groupings were based upon 
psychiatrists’ ratings of improvement at the ter- 
mination of treatment and upon follow-up 
reports of at least six months duration. The 
use of a threefold classification of improve- 
ment as well as the use of follow-up studies 
would seem to enhance the possibility of 
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achieving greater validity and reliability in this 
study. 

The data presented in Table 1 indicate 
that the groups used were not unusual in any 
respect. The unimproved group is somewhat 
younger while the recovered group tends to 
be somewhat more intelligent. A more de- 
tailed summary of the personal characteristics 
of the groups is presented elsewhere [2]. 














Table 1 

Personal Characteristics of the Prognostic Groups 
Average 

Grot Vy Aver- Sex preshock 

y ) i . ° 

P age M F _intelligence* 

Age N Mean 

Recovered 82 38.0 36% 64% 48 111 

Improved 40 34.5 28 72 14 107 

Unimproved 62 31.5 48 52 30 103 

Allcases i184 35.0 39% 61% 92 108 
*Based upon four subtests; Similarities, Comprehen- 


sion, Block Design, and Digit-symbol of the Wechsler- 
Bellevue Scale. 


Table 2 presents the distribution by diag- 
nosis and type of shock treatment. It can 
readily be seen that the groups are not equal 
in these respects. While it would have been de- 
sirable to equate the prognostic groups in 
terms of age, intelligence, sex, diagnosis, and 
type of shock treatment,? such procedures 
were not feasible with the present-sized sam- 
ples. 


2Electric shocks (ES), electronarcosis (EN), and 
insulin (I) were used. 
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Table 2 


Distributions within Prognostic Groups by Diag- 
nosis and Type of Shock Treatment 





‘Type of shock 





Group N Diagnosis*® treatment 
Aff. Schiz. PN ES EN I 

Recov- 

ered 82 64% 29% 5% 75% 10% 15% 

Im- 

proved 40 38 42 20 3 @ & 

Unim- 

proved 62 16 54 27 41 18) 


41 


*Percentages do not add up to 100% because several 
cases were not assigned a definite diagnosis. 


Results on Prognostic Groups 


Two analyses were made of the preshock 
scores. First the scores of the prognostic 
groups were analyzed without respect to 
diagnosis. Then separate analyses were made 
to compute the average profile of the affective 
disorders* and schizophrenic groups within 
each prognostic category to determine if prog- 
nostic signs differ according to diagnosis. For 
the first analysis the mean JT scores on the pre- 
shock records of each of the three prognostic 
groups were tabulated without respect to di- 
agnosis. Table 3 indicates some systematic 
differences between the three groups on their 
MMPI profiles which may be of prognostic 
value. Figure 1 portrays these relationships 
graphically. The important findings may be 


8The affective disorders included cases diagnosed 
as manic-depressive, depressed; involutional melan- 
cholia; manic-depressive, manic. Most of the cases, 
however, were manic-depressive, depressed. 


























Table 3 
Comparison of Preshock MMPI Averages of the Three Prognostic Groups 
aie « 4@ FF & D By Pa Mt Pa Pt Se Ma 

xs xs x 68 x 5s a 8 x 8s x 8 x 5s x s x 8s 8 x s 
a a ee 82 62 85 659 6418 8416 7012 6411 65611 68 12 68 10 6618 52 12 
Improved...... 40 42 106 496 6612 8118 6912 7112 6512 6811 73 9 7618 6214 
Unimproved.. 62 42 116 498 6714 8516 6912 7412 6118 7812 7610 8014 61 13 
p* value from: 
Rec. vs. Imp 01 -06 01 01 01 01 01 
Rec. vs. Unimp 01 01 01 -O1 -05 -05 01 01 01 
i ite Ca ccctettercticentnttiinioen = -05 -05 





*p values based upon critical ratios. 


+tRaw scores given on L and F; T scores given on all other scales. 
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summarized as follows: 

1. The average profile of the recovered 
group is quite different from both the im- 
proved and unimproved groups who have 
quite similar profiles. 

2. Scores on the “neurotic triad,” Hs, 
depression (D), and hysteria (Hy) do not 
differentiate the groups. 

3. The most discriminating scales are Pdr, 
Pt, Sc, hypomania (Ma). 

4. The recovered group has significantly 
higher scores on L and K and significantly 
lower scores on F than either of the other two 
groups. 

Nine of the 12 scales (excluding only Hs, 
D, and Hy) differentiate significantly at the 
5 per cent level or higher between the re- 
covered and unimproved groups. Thus the 
present results are quite contrary to Carp’s 
findings [1] in that a poor prognosis for shock 
therapy accompanies high scores on those 


scales of the MMPI which reflect the most 
severe types of psychiatric disorders.* 


In the second analysis the mean 7° scores 
were computed on the basis of diagnosis with- 
in each prognostic group. Table 4 indicates 
that affective disorders and schizophrenics 
have markedly similar MMPI profiles when 
analyzed on the basis of response to shock 
treatment. In other words, the preshock 
MMPI profile of a recovered schizophrenic 
tends to be more similar to that of a recovered 
affective disorder than to an unimproved 
schizophrenic. However, 2 iew trends emerge 
from the data which might aid the clinician 
to make a more accurate prognosis when the 


It should be noted that the scales were not K 
corrected. At the time the data was collected, K 
was not in general use. Inspection of the data, how- 
ever, indicates that the use of K would reduce the 
differences obtained in the present study; hence the 
accuracy of predictions of response to shock ther 
apy would be reduced. 
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Fig 1. Average preshock MMPI profiles of groups rated recovered, improved, and unimproved after 


treatment 
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Table 4 


Difference between Preshock MMPI Means of Schizophrenics and Affective 


Disorders within Each Prognostic Category 











Recovered 





Improved 





Unimproved 











Sc Aff. Sc Aff. Sc Aff. 

N—24 N=52 N17 N15 N34 N=10 

x s x s x s x s x s or 
Li 5 ae. 4 2 3 2 4 2 3 2 4 2 
F 10 4 7 4* 11 5 10 5 14 6 10 508 
K 53 i) 55 9 47 6 49 6 48 9 49 5 
Hs 65 14 62 11 64 13 69 11 63 11 72 13** 
D 7 14 85 17 71 16 89 17* 80 17 93 20** 
Hy 67 11 71 11 65 12 72 11 65 11 71 11 
Pd 66 12 63 11 70 14 70 10 78 12 70 13 
Mf 55 11 56 11 56 12 58 10 62 14 57 11 
Pa 74 12 65 12* 67 9 67 12 74 11 73 12 
Pt 69 11 67 10 68 9 76 7% 75 11 77 9 
Sc 71 12 65 13 75 13 76 11 83 13 75 12 
Ma 56 13 52 12 69 13 59 10** 63 12 58 15 








*Significant at the .01 level. 
**Significant at the .05 level. 


traw scores given on L and F; T scores given on all other scales. 


diagnosis is known. It appears to be an un- 
favorable prognostic sign for a patient diag- 
nosed as an affective disorder to be overly 
concerned with bodily functions as shown by 
high scores on Hs. Affective disorders score 
consistently higher than schizophrenics on the 
D scale in each prognostic category, but there 
is very little difference between the scores of 
the diagnostic groups on the Sc scale within 
each prognostic category. 


Discussion 


The present study presents fairly conclusive 
evidence in support of the prognostic value of 
the MMPI profile for shock therapy. The 
writer can only conjecture that the discrepan- 
cies in the results of this study with those of 
previous studies arise from differences in 
methodology chief of which is the inclusion 
or exclusion of the test misses. The study 
raises a question about possible personality 
differences between those patients who re- 
spond well and those who respond poorly to 
shock treatment. The overt symptomatology ap- 
peared extreme enough to lead to a psychiatric 
diagnosis of psychosis in most of these cases. Yet 
the average profile of the recovered group re- 
sembles that of less severely disturbed neurotics 
who responded favorably to psychotherapy 


[7]. It could very well be the case that pa- 
tients can manifest rather bizarre symptoms 
at times even though their underlying ego 
structure is fairly intact. From an analysis of 
the actual items on the MMPI which differ- 
entiate the prognostic groups, the writer has 
suggested elsewhere [2] that patients who 
recover with shock treatment perceive the pos- 
sibility of satisfactory interpersonal relations 
even before therapy while the converse is true 
of those who remain unimproved. 


This study also suggests that prognostic 
groupings cut across conventional diagnostic 
categories. At least for the most severely dis- 
turbed patients, a classification system based 
upon prognosis may effect a more homogen- 
eous grouping than one based upon diagnosis. 
If this assertion is valid, it would follow that 
studies of other variables in relation to the 
prognostic categories, e. g., scatter on the 
Wechsler-Bellevue, degree of concretization 
of thought, etc., would reveal higher corre- 
lations than can be demonstrated with diag- 
nostic groups. 


Evaluation of Treatment 


Previous studies using postshock MMPI 
profiles to evaluate the results of treatment 
agree fairly well [1, 4, 10]. Improved groups 
have average JT scores well within normal 
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Fig. 2. Average postshock MMPI profiles of groups rated recovered and unimproved after treat- 


ment 


limits on all scales, while unimproved groups 
tend to have as high or even higher postshock 
records as their preshock records. Some inter- 
esting results were obtained by Pacella, 
Piotrowski, and Lewis [10] who gave two 
postshock tests, the first a week after the end 
of treatment, the second four to eight weeks 
later. After one week their group of affective 
disorders had average T' scores below 50 on 
most scales but on the second testing their 
scores tended to increase somewhat. The 
schizophrenic group which improved, how- 
ever, had average T' scores around 50 on most 
scales on the first posttest which tended to be- 
come supernormal (around 40) on the second 
test. The schizophrenic group who remained 
unimproved had T' scores on the first posttest 
which matched their preshock profile fairly 
closely, but on the second posttest their scores 
became even higher. While the interpretation 
of these findings is not obvious in terms of 


concomitant behavioral changes, the results 
make it imperative for similar investigations 
to report the time of post-treatment testing. 
The data in the present study are too 
limited to warrant an extensive analysis. Only 
56 postshock records divided evenly between 
the recovered and unimproved groups were 
collected. The tests were administered to most 
patients about one week after the last shock 
treatment. The mean T' scores were computed 
for the two groups without respect to diag- 
nosis. Figure 2 illustrates that the postshock 
records differentiate the groups even more 
clearly than did their preshock tests. Only 5 
of the 28 recovered patients had T' scores of 
70 or over on their postshock tests. The 
highest 7’ score was 78 and in most of these 
five cases the D score was an isolated peak. 
In contrast 25 of the 28 unimproved records 
had at least one T score of 70 or more. Also 
the K score tended to be fairly high in the 
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recovered group. It may be that patients who 
are rated “recovered” after shock treatment 
still have severe conflicts which are denied 
and suppressed. If the records had been cor- 
rected for K, the average differences in the 
profiles would be smaller. Thus the uncor- 
rected profile can serve as a fairly valid index 
of response to shock treatment. It is unfor- 
tunate that posttest results were not obtained 
on the improved group. It can be surmised 
that they would retain their intermediate 
position, thereby increasing the difficulty of 
using the MMPI profile to evaluate response 
to shock treatment. 


Summary 


1. On the basis of post-treatment clinical 
ratings, the pretreatment MMPI records of 
184 patients were divided into three prognos- 
tic groups, recovered, improved, and unim- 
proved. 

2. The profiles of the three groups differed 
significantly on a number of scales. High 
scores on the psychotic, psychopathic deviate, 
and F scales accompanied a poor prognosis 
for shock therapy; high scores on the L and K 
scales accompanied a favorable prognosis. 
Scores on neurotic scales did not differentiate 
the groups. 

3. The patterns observed in the groups as 
a whole did not change appreciably when the 
groups were analyzed further on the basis of 
diagnostic categories. 

4. Post-treatment MMPI profiles reflected 
response to therapy quite accurately. 
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A Technique tor Objective Configural Analysis 
of MMPI Profiles 


Patrick L. Sullivan 


Veterans Administration Mental Hygiene Clinic’ 


and George S. Welsh 
Veterans Administration Hospital’ 
Oakland, California 


The one aspect of psychological testing 
about which clinicians have generally agreed 
is the necessity for taking a holistic attitude 
when evaluating protocols. The number of 
articles concerned with this and the number 
of references to it are legion. Such terms as 
“molarity,” “configuration,” “pattern,” and 
“the total personality” are used to express 
this point of view. What is more, it has fre- 
quently been possible to demonstrate that the 
configural approach yields greater predictive 
accuracy than does the more atomistic. 


For example, Gough’s [2] four judges 
were able to detect MMPI dissemblers with 
greater precision than was achieved through 
the use of either the F or K scales alone. 
Aaronson and Welsh [1] showed that judges 
were able to assign the averaged profiles of 
psychiatric groups to the proper diagnostic 
categories even though Rubin [9] had not 
been able to accomplish this with the same data 
using analysis of variance of the individual 
scales. More recently, Guthrie [3] has dem- 
onstrated that judges using the configural 
approach can accurately sort individual pro- 
files into diagnostic categories. The judges in 
Hanvik’s [4] study of low back pain were 
able to distinguish between “functional” and 
“organic” patients, as well as to classify the 
profiles into the standard psychiatric groups. 


‘Reviewed in the Veterans Administration and 
published with the approval of the Chief Medical 
Director. The statements and conclusions published 
by the authors are the results of their own study 
and do not necessarily reflect the opinion or policy 
of the Veterans Administration. 


One problem in the use of judges is that 
they have been found to vary widely in their 
ability to sort protocols. This has been dis- 
cussed briefly by Hanvik [4]. There have 
occurred, furthermore, instances [6] in which 
none of the judges participating was able to 
sort profiles at better than a chance level. 
From these observations it would appear that 
a method for objectifying some of the aspects 
of the configural analysis employed by judges 
would yield beneficial results in terms of con- 
sistency and accuracy. 

The problem of configural analysis has been 
most cogently dealt with by Meehl [7] and 
it was as an application of the argument 
presented by him that the present paper was 
conceived. Where Meehl, demon- 
strates the possibility of deriving differen- 
tiating measures from the patterning of items, 
the technique developed in this paper has for its 
basic unit the clinical scales of the MMPI. A 
further difference consists in the fact that here 
the ranks of the scales of two groups of 
persons are used, whereas Meehl used the 
direction of answers to single items. There 
is an essential similarity in that each approach 
seeks to derive empirical signs or 
which differentiate identified groups from 
suitable control groups. A discussion of the 
implications and applications of the technique 
will perhaps be more meaningful if the method 
is first explained and illustrated. 


The Technique 


As was stated above, the essence of the 
method is the comparison of the ranks of the 


however, 
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clinical scales of the MMPI’s obtained from 
two groups of individuals. In order to facili- 
tate this procedure, our first step was to con- 
vert the 7 scores of all the individual records 
to the coding system developed by Welsh 
[11, 12]. In this system, each of the clinical 
scales is assigned a number from 1 to 9, the 
order of the numbering corresponding to the 
left-right order of the scales on the psycho- 
grams, i. e., Hs, 1, to Ma, 9. Appropriate 
symbols are employed to indicate the T-score 
deciles in which each of the scales falls. The 
scale code numbers are arranged in order of 
descending elevation, with the symbols placed 
to the right of the scales in a given decile. In- 
asmuch as our method requires only the rank 
order, these symbols need not be used. 


Table 1 


Rank Order Tabulation of Two Hypothetical 
MMPI Profiles 


Subject A, Profile Code, 1'3 4-2 876/9 5 :+ 


Scale Pairs Sums 

















Subjects 
A BD. Sinai + - = 
1-2 +r = 
1-3 + + 
14 r= 
1-5 + + 
1-6 f- id 
1-7 + — 
1-8 + = 
1-9 + + 
a - + 
2-4 — + 
2-5 + + 
2-6 + + 
2-7 7 +> 
2-8 + + 
2-9 + + 
34 - = 
3-5 + + 
8-9 + + 





¢#The underlinings indicate that the scales are within 
one T-score point of each other, 1.e., are “equal.” 


Looking forward to the future clinical 
application of the technique, we did not use 
the actual numerical ranks of the various 
scales. It appeared that the amount of time that 
would be necessary for numerical manipula- 
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tion might outweigh the advantages gained 
by the use of the present procedure. Instead, 
rank comparisons were made with the use of 
plus, minus, and equal signs: the higher of 
two scales was assigned a plus, the lower a 
minus, while scales were considered to be 
equal if they were within one T-score point 
of each other. The actual comparisons of the 
ranks were made within each individual 
record (Table 1). The group sums of the 
pluses, minuses, and equals were then obtained 
for each of the scale-comparison pairs, of 
which there are n(n-1)/2, or 36 for the 
MMPI. If a check on accuracy is desired, all 
72 of the possible pairs can be made, but the 
second occurrence of each combination will 
show the signs reversed. Thus, if Hs is higher 
than D, the 1-2 comparison will be +, while 
the 2-1 comparison will be -. 


Tests of significance of difference between 
the two groups were then applied to the totals 
of the pluses, minuses, and equals for each of 
the scale comparisons. Those differences which 
separated the two groups in both the plus-plus 
and minus-minus directions at the .05 level 
were tentatively accepted as “signs.” Only 
plus and minus were considered because in 
this study the number of equals was usually 
negligible. They may, of course, be combined 
with either the pluses or minuses or used 
separately when analyzing data. 


The next step was to ascertain the cross- 
validity of the “signs,” for which groups 
similar to the first two were independently 
selected. The MMPI profiles were assigned 
scores based on the number of “signs” they 
contained and the difference between the mean 
“sign” scores tested for significance. As a 
final step, derivation of “signs” from our 
cross-validation groups was also accomplished. 
Only those differences attaining the .05 level 
in both sets of data were accepted as the final 
“signs.” The final “signs” are suggested for 
general use, for they represent the significant 
findings in two sets of data after “shrinkage” 
has been allowed for. 


An Illustrative Application 


In order to determine the efficacy of the 
technique, the MMPI profiles of 60 male 
Veterans Administration Mental Hygiene 


wy 
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Clinic patients were obtained.? Thirty of these 
were known to have gastric ulcers, while the 
remaining thirty had neither an ulcer nor any 
other psychosomatic disorder. The results of 
the analysis of the 60 profiles, showing only 
the comparisons of the Hypochondriasis scale 
with all others, will perhaps further clarify 
the procedure (Table 2). These 60 individuals 
will henceforth be referred to as the Criterion 
Group. 


Table 2 
Results of Comparisons of Ranks of Hs Scale 
with Other MMPI Scales Based on 30 
Ulcer and 30 Nonpsychosomatic 
NP Male Patients 








Differences 








Scale Pairs Ulcer Control 

4 —_ — + —= 4 - == 

1-2 ss tt 2 4 24 2 11* -11* 0 
1-3 23 a 3 i9.4 12” —s* -—4 

1 19 7 4 13 16 1 6 -— 99° 3 
1-5 23 6 1 13 15 2 10° -— 9°* -1 
1-6 25 4 1 19 9 2 6 - 5 -1 
1-7 18 10 2 - i gee — 79° -2 
1-8 21 . 2 7; wa 12® -11* -1 
1-9 23 4 3 19 9 2 4 - 5 1 
* and ** signify phi coefficients significant at the .01 


and .05 levels, respectively. 


As may be seen from Table 2, the following 
scale-pairs characterize the profiles of the ulcer 
group in both the plus-plus and minus-minus 
directions : 

1. 1 (Hs) higher than 2 (D); 

2. 1 (Hs) higher than 3 (Hy); 

3. 1 (Hs) higher than 5 (Mf); 

4. 1 (Hs) higher than 7 (Pt); 

5. 1 (Hs) higher than 8 (Sc). 


In addition, comparisons of the other scales 
with each other yielded three more tentative 
“signs” which identify ulcer patients’ profiles: 


6.2 (D) higher than 3 (Hy); 
7. 3 (Hy) higher than 4 (Pd); 
8. 3 (Hy) higher than 5 (Mf). 


Drawing again on the male population of a 
Veterans Administration Mental Hygiene 
Clinic, sixty more MMPI’s were obtained. 
Thirty of these were ulcer patients’ and thirty 

“The authors are greatly indebted to Dr. Alvin 


Scodel, Ohio State University, for making these 
records available. 


were contributed by unselected patients; the 
latter group differed from the first control 
group in that they were randomly selected, 
i. €., psychosomatic cases were not eliminated. 
This appeared to be necessary because the 
working situation usually requires differenti- 
ations of particular kinds of patients from 
among the patient population at large. These 
records, which comprise the Validation Group, 
were scored for the eight “signs” shown above, 
the results for which are shown in Table 3. 
By the use of a cutting score of 6, it is seen 
in Table 3 that 21 ulcer patients and 19 con- 
trol patients, or 67% of the total group, are 
accurately identified. The phi coefficient for 
this percentage of accuracy is 0.33 (p=.01). 


Table 3 
Results of Scoring for Tentative Ulcer “Signs” in 
MMPI Records of Validation Group 
(30 Ulcer and 30 Unselected 
NP Male Patients) 








Number of “Signs” Ulcer Group Control Group 
8 + 3 
7 21 } 13 11 15 
6 L 4 3 
5 6 8 
4 2 0 
3 9 1 19 a 
2 0 3 
1 0 2 
0 0 2 

N 30 30 

Mean 6.27 4.57 

SD 1.27 2.37 
DM 1.70 
SEm AY 
CR 3.47 
p < .001 





These records were also subjected to the 
same analysis as the original sixty records, with 
the following significant differences resulting: 


1 (Hs) higher than 2 (D); 
1 (Hs) higher than 3 (Hy); 
1 (Hs) higher than 4 (Pd); 
1 (Hs) higher than 7 (Pt); 
1 (Hs) higher than 8 (Sc); 
1 (Hs) higher than 9 (Ma); 
7.2 (D) higher than 3 (Hy); 


AYP wh = 


Thus, there were five relationships which 
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separated the ulcers from the controls in 
both instances: 


1. 1 (Hs) higher than 2 (D); 
2. 1 (Hs) higher than 3 (Hy); 
3. 1 (Hs) higher than 7 (Pt); 
4. 1 (Hs) higher than 8 (Sc); 
5 


. 2 (D) higher than 3 (Hy). 


Table 4 


Results of Scoring for Final Ulcer “Signs” in 
MMPI Records of Criterion (30 Ulcer and 
30 Nonpsychosomatic NP Male Patients) 
and Validation (30 Ulcer and 30 
Unselected NP Male Patients) 








Groups 
Number ts : sf 
of Criterion Group Validation Group 
“Signs” Ulcer Control Ulcer Control 
5 ( 5 1 4 (3 
4 22;11 9} 4 26418 146 
3 L 6 4 4 5 
2 2 f 5 3 (7 
1 8 6 21 i 4; 1 16 i 
0 0 1 0 2 
N 30 30 30 30 
Mean 3.23 1.93 3.70 2.50 
SD 1.34 1.26 94 1.45 
DM 1.30 1.20 
SEm 34 32 
CR 3.86 $.75 
P < .001 < .001 





Scoring all the records for the five ulcer 
“signs” yielded the distributions presented in 
Table 4. As will be noted, the mean differ- 
ences in each of the ulcer-control pairings are 
significant beyond the .001 level. Using a 
cutting score of 3, 71.6% and 70.0% correct 
identification is achieved in the Criterion and 
Validation groups, respectively. These percen- 
tages have respective phi-coefficient values of 
0.41 and 0.40, both of which are significant 
beyond the .01 level. 

As is also obvious from Table 4, the two 
ulcer and control groups were not combined 
and single means, critical ratios, etc. derived. 
The reason for not combining the control 
groups is apparent when it is recalled that they 
were selected in a somewhat different manner. 
Concerning the ulcer groups, they were not 
combined because computation of the means 
and standard deviations of the individual 
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scales revealed that the two groups contained 
significant dissimilarities.. Table 5 presents 
these data, in which the Depression, Psychas- 
thenia, and Schizophrenia scales are signifi- 
cantly lower in the Validation group than in 
the Criterion group. These differences indicate 
that the Criterion group was much more 
maladjusted in the sense of scale elevation 
than the Validation. Despite these fluctuations, 
it was possible to make accurate separations 
through the use of “signs.” 


Table 5 


Means and Standard Deviations of MMPI Scales 
for Criterion (30 Ulcer and 30 Nonpsychoso- 
matic NP Male Patients) and Validation 
(30 Ulcer and 30 Unselected NP 
Male Patients) Groups 











Criterion Validation 
Group Group 
Uleer Control Uleer Control 
Hs Mean 77.3 66.0 75.3 75.0 
SD 16.8 15.6 13.2 15.5 
D Mean 79.4 79.0 72.7 79.7 
SD 13.6 13.2 11.8 14.1 
Hy Mean 74.0 66.3 71.7 73.3 
SD 13.2 11.3 17.2 11.7 
Pd Mean 65.0 68.7 61.3 64.3 
SD 12.1 13.3 11.8 12.2 
Mf Mean 63.7 71.0 62.0 58.0 
SD 9.3 13.3 11.6 12.1 
Pa Mean 59.0 60.3 54.7 $7.7 
SD 9.5 10.8 10.0 10.4 
Pt Mean 73.3 72.0 64.0 71.7 
SD 13.6 14.3 13.9 14.7 
Sc Mean 69.0 72.7 60.0 70.0 
SD 15.8 13.9 14.8 16.7 
Ma Mean 60.3 59.3 54.7 61.0 
SD 15.6 9.1 10.6 12.8 





With the data contained in Table 5 in 
mind, we may address ourselves to another 
question, viz., is the assumption that inter- 
scale comparison is superior to intrascale com- 
parison justified? That is, might not the com- 
parison of experimental and control individ- 
uals’ Hs, D, etc. scores yield as good or better 
differentiation than the rank comparison 
method presented? This question lies at the 
heart of the “atomism” versus “configuration” 
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argument with which we are concerned. 


In order to determine the efficacy of the 
single-scale approach, distributions of the 
scores for each MMPI scale were made for 
each of our groups. Cutting scores which yield- 
ed the best possible separation between the ul- 
cers and controls within the Criterion and 
Validation groups were determined. For the 
Criterion group, three such cutting scores 
were derived which identified the ulcer pa- 
tients: Hs greater than 70, Hy greater than 
80, and Mf less than 70; for the Validation 
group, two scores were obtained: D greater 
than 80 and Sc less than 60. Scoring the Vali- 
dation group for the “signs” obtained from the 
Criterion group, and vice versa, yielded only 
chance differences between the ulcer and con- 
trol patients within those groups. The fact that 
none of the indices is repeated is further evi- 
dence contra-indicating the efficacy of this ap- 
proach. It must be noted, however, that such 
complete failure might not be the case were 
the successive groups more homogeneous than 
ours. 


Discussion 


It is presumed to be apparent from the fore- 
going presentation that the technique elabo- 
rated is concerned with the assignment of in- 
dividuals to classes of persons sharing some 
known commonality. Although we have pre- 
sented data on patients with gastric ulcers— 
easily detectable by other devices—the method 
of analysis is as readily applicable to a host of 
variables, such as dissembling, aptitude for 
teaching, readiness for therapy, etc. That is, 
in any situation which allows for the separa- 
tion of individuals on the basis of a given vari- 
able, it is possible to determine the existence 
of indices by the procedure outlined. 


The application of such an analysis does 
not, of course, restrict the appraiser of an in- 
dividual’s protocol to this kind of consider- 
ation, nor does it detract from an elaboration 
of the nuances of the idiosyncratic make-up of 
the tested person. It does, however, lay stress 
on a nomothetic approach to the clinician’s 
data. We would contend that such an attitude 
inheres in more of our clinical work than we 
might care to recognize. For example, it stands 
to reason that the judges in the study of 


Gough previously cited had some preconceived 
set of notions of what a dissembled record 
should look like, ana that they consistently 
applied these expectations to the records 
judged. Similarly, in day-to-day work we look 
for patterned relationships which correspond 
to indices of classes, either derived through 
experience or conjured up rationally. We are 
proposing, then, that not only does the clin- 
ician employ class inferences, but also that 
what he does can be objectified and shown to 
be the most fruitful way of handling his data. 
The empirical technique presented offers a 
means of checking on the authenticity of 
patterns which are thought to identify some 
trend, trait, or diagnostic group. 

To exemplify, it has become the custom to 
expect all psychosomatic patients to give 
MMPI profiles which contain the “psychoso- 
matic V,” i. e., Hs and Hy elevated over D 
in a synclinal relation. The analysis of the 
records of the particular psychosomatic group 
used in this study—ulcer—revealed that this 
patterning did not result, but that D was sig- 
nificantly more often elevated over Hy in the 
ulcer patients’ records. 

Another advantage of configural analysis 
is that every possible relationship is appraised 
for its differentiating power. Scales of inter- 
mediate elevation may prove to be as predic- 
tive as those of maximum elevation when the 
relationships between them and the other 
scales are scrutinized. 

It must be noted that rank comparisons do 
not give consideration to the absolute eleva- 
tions of the scales but only to their relative 
magnitude. Ideally, a method of analysis 
would give consideration to both of these 
factors, and the failure of our method to 
achieve this must be regarded as a definite 
disadvantage. One of the authors (G. S. W.) 
is attempting to develop a technique which 
will handle the problem in its entirety. Even 
with this limitation, the method of configural 
analysis presented here does, as we have seen, 
permit us to make meaningful identifications 
of profiles. 

Although the use of an objective pattern 
analysis with the MMPI is not new, it has 
not previously been systematically elaborated 
as a method which may be applied in a variety 
of clinical service and research settings. 
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Pearson [8], for example, writes of “tabu- 
lating a number of combinations of the 
MMPI scale values,” from which he derived 
three indices of probable response to electric 
shock therapy. Again, Harris, et al. [5] report 
the use of “prognostic criteria” which con- 
sisted of the relative elevations of certain 
MMPI scales in the records of a number of 
clinical groups. Rigorous scale comparisons 
utilizing all possible relative comparisons were 
not, however, reported. Hanvik [4] used 
pattern analysis of coded profiles but notes 
that “it could not be carried out in statistical 
terms.” 

Nor is the use of configural analysis limited 
to the MMPI. To cite one instance of possi- 
ble application, the Wechsler-Bellevue lends 
itself readily to such treatment. Presenting 
the test characteristics of clinical groups, 
Wechsler [10] couches the indices in terms 
of deviations from subtest means. If the pre- 
sented indices actually allow differentiation 
of the various clinical groups, first coding the 
scales and then subjecting them to configural 
analysis would conclusively verify this fact. 
A study employing this procedure, which is 
designed to determine the existence of typical 
Wechsler-Bellevue patterns for good and poor 
readers in a primary-school population, is 
now being conducted by graduate students at 
the University of California, Berkeley.® 


Summary and Conclusions 


A technique for objective analysis of psy- 
chological test profiles based on rank compari- 
sons of the coded scales within individual 
records has been presented. Using the MMPI 
as the sample test with a group of patients 
known to have gastric ulcer, it has been dem- 


8Ruth Kostlan and Martin Fink, personal com- 
munication. 


Patrick L. Sullivan and George S. Welsh 


onstrated that the technique can be used to de- 
rive “signs” which differentiate them from 
nonpsychosomatic mental kygiene clinic pa- 
tients. These “signs” were also shown to 
differentiate a validating population of ulcer 
patients from randomly selected psychiatric 
patients. The application to other tests which 
can be coded has been outlined. 


Received March 17, 1952. 
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Relationships Among the Szondi Test Items 


Frank J. Dudek and Harry ©. Patterson’ 


University of Nebraska 


This article reports the first of a series of 
studies designed to provide information about 
some of the relationships and consistencies 
among the Szondi test items. The Szondi Test 
is a relatively recent contribution to the area 
of projective techniques and psychodiagnostic 
tests. It consists of 48 pictures of individuals 
(of both sexes) representing eight different 
pathological categories.2 For purposes of ad- 
ministration the 48 pictures are divided into 
six sets of eight pictures each. Each set in- 
cludes a picture representing each of the eight 
diagnostic categories. Essentially, the task of 
the subject is to choose from each set of eight 
pictures the two pictures he “likes” and the 
two he “dislikes.” Choices from all sets are 
considered when interpretations are made. 
However, this study was not concerned with 
the test as a diagnostic device, and it is not 
to be interpreted as a “validity” study. For 
this reason more elaborate details of the theory 
and use of the Szondi Test are omitted and the 
reader is referred to Deri [2] for such a 
treatment. 


Effective utilization of this device would 
seem to require that the test show a kind of 
internal consistency. That is, it would seem 
important that the various “items” used for 
the same purpose be homogeneous and con- 
sistent in their function. Thus, the design and 
nature of these studies are intended to provide 
information of a fundamental kind about the 


1Mr. Patterson is now at Baldwin-Wallace Col- 
lege. The collection and tabulation of the original 
data were carried out by Mr. Patterson and are 
included in an unpublished master’s thesis at the 
University of Nebraska entitled, “A Study of 
Matching Behavioral Descriptions with Szondi 
Pictures.” This work was carried out in 1949-50. 
The data have since been reanalyzed by Mr. Dudek 
and it is this reinterpretation that is presented here. 


2The eight categories are: (H) Homosexual, (S) 
Sadist, (E) Epileptic, (Hy) Hysteric, (K) Cata- 
tonic, (P) Paranoid, (D) Depressive, and (M) 
Manic. 


relationships among the individual items of 
which the test is composed. The fact that the 
diagnostic value of the test as a whole rests on 
the pattern of pictures which are “liked” and 
“disliked” by the individual to whom the test 
is being administered suggests certain assump- 
tions. These might be stated as: (i) the 
pictures included in the test do represent and 
express certain differentiable personality quali- 
ties; or, in other words, the pictures within 
any given diagnostic category have certain 
characteristics in common which differentiate 
them individually and as a group from the 
pictures in other diagnostic categories; and, 
(ii) subjects do make discriminations or 
choices of some and these 
choices are in some way related to the person- 
ality characteristics of the subjects. ‘This study 
is concerned primarily with the first of these 
assumptions, and data were gathered in order 
to answer such questions as: (a) What do the 
various Szondi pictures mean to a group of 
subjects? (6) Will subjects, given an oppor- 
tunity to associate Szondi pictures with des- 
criptive passages of presumed pathological 
categories, make associations of some consistent 
kind? (c) Are the various pictures represent- 
ing the different categories relatively homo- 
geneous? (d) Do associations that subjects 
make (if these are consistent—i. e., greater 
than “chance’’) agree with the diagnostic cate- 
gories indicated by the Szondi Test? Various 
experimenters [1, 3, 4, 5, 6, 7, 8, 9], have 
attacked these problems in different ways. To 
a large extent these studies provide a body of 
rather consistent information. The present 
series of studies furnishes additional evidence 
to that rapidly accumulating in this area. 


consistent sort 


Subjects and Procedure Utilized in the 
Present Study 
One hundred subjects were tested for this 
investigation. This sample, while no random 
sample of any specified population, was not 
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Table 1 

Frequency of Association of Each Picture with Each Description 

H Pictures S Pictures 
Set 1 2 3 4 5 ae 1 2 3 4 5 6 z 
H i SF 2 ae”: Se oe 5 4 18 3 4 10 44 
S 27 5 12 8 1 a ww 5 4 10 35 69 
& £ in ee es ee See i 0 a 
2 Hy 1 5 6 8 2 4 26 6 1 il 1 2 5 26 
me K 7 3 0 2 0 2 14 1 0 2 4 12 3 22 
& P 4 9 3 7 2 “a 0 0 3 3 6 9 21 
& D 0 4 0 0 2 2 g 0 0 0 1 2 2 5 
3 M ) Ba Bowe 4 la oe 8 18 2 13 . 64 
aN, 26 pt ae 2 6 20 7 g 9 10: 59 25 6 117 
N, 16 mt wa Co Rm oe Re 34 12 11 13 4 94 
N, 2 3) «12 5 7 ‘ Ss Be 30 8 7 7 2 78 
N, 3 «(19 ‘ *».@ 2 3 8 7 5 3 2 28 
z 100 100 100 100 100 100 600 100 100 100 100 100 100 600 

K Pictures P Pictures 
Set 1 2 3 4 5 6 1 2 3 4 5 6 = 
H 1 7 4 3 0 : ‘ss 5 8 0 5 4 5 27 
S 14 3 3 6:10 0 :-R 14 32 8 28 12 14 = 108 
& E 15 le =. a 3 @ g 3 5 11 10 53 
& Hy 7 11 19 6 27 6 76 8 2 2 0 5 7 24 
m K 12 33 8 9 5 1 68 16 13 11 10 8 15 73 
> P 25 11 33 7 15 i 92 10 il 0 2 18 27 68 
& D ae aS oe a a a 6 6.20 4 4 15 65 
5 M 4 0 4 20 a 8 7 0 7 9 5 36 
aN, 0 6 2 1 0 $ 612 6 8 46 20 15 1 96 
N, 0 0 2 8 1 36 47 0 3 3 11 4 1 22 
N, 0 0 2 5 2 36 45 0 1 1 5 7 0 14 
N, 1 6 3 11 0 0 21 1 1 6 3 3 0 14 
z 100 100 100 100 100 100 600 100 100 100 100 100 100 8 600 





drawn from a college population. Of the 100 
subjects only 12 were college men and women, 
the remainder representing various occupa- 
tions. The age range was from 20 to 56 years. 

The design of the experiment was simply 
to provide each subject with descriptive state- 
ments of the eight categories represented in 
the Szondi pictures, and with four descrip- 
tions of a “neutral” nature.* Each description 
was typed on a 3x5 card and the format of 
each card was such that all descriptions 
appeared about equal in length. The descrip- 


8These statements were drawn up by a panel 
consisting of Mr. Patterson and members of the 
Psychology Department staff at the University of 
Nebraska. It may be of interest for the reader to 
compare these with comparable descriptions used 
by Klopfer and Borstelmann [4], in a similar study. 


tions were as follows: (The name in parenthe- 
ses before each passage identifies the Szondi 
category presumably described. This name was 
not a part of the description available to the 
subjects. ) 


(Homosexual) A person who tends to behave 
like and have interests more usually associated 
with the opposite sex, and who is often found in 
groups made up of individuals of his own sex, 
and very much like himself. 

(Sadist) A pers-~ who is cynical about social 
activities, is not affected by personal or emotion- 
al appeals, and who is perhaps cruel and bru- 
tally rough in dealing with other people. 

(Epileptic) A person who is slow in reacting, 
who has changes of moods for no observable rea- 
son, but has periods of tension and reacts to a 
crisis with uncontrollable anger outbursts or ° 
blacking out. 
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Table 1 (Continued) 


Frequency of Association of Each Picture with Each Description 


























E Pictures Hy Pictures 

Set 1 2 3 4 5 6 1 2 3 + 5 6 z= 
H 25 1 3 4 8 3 (44 2 7 0 12 7 1 29 

S 9 7 . -» s&s 4 69 . ma 43 4 3 3 91 

bo E 2... 2. a ee. oe Se 6 2 2 66 
2 Hy 0 2 5 2 2 4 25 9 8 5 6 10 7 45 
oo K 3 15 34 20 2 . 7 7 6 17 12 12 91 
2 P ae a hie, i cae > oe 11 14 14 11 70 
% D °° 48° “Shree » ee > a > “On 
5 M 6 0 1 2 9 ma 2 0 2 2 3 11 
6 N, 7 1 9 + ss: @ 8 2 0 11 9 5 35 
N, 9 1 ee oe aoe Soe 1 0 0 3 4 2 10 
N, 8 0 0 0 17 on we 1 0 1 2 0 4 
N, 13 0 0 1 9 2% 49 3 0 0 8 18 2 31 

= 100 100 100 100 100 100 600 100 100 100 100 100 100 600 

D Pictures M Pictures 

Set 1 2 3 4 5 6 = 1 2 3 4 5 6 ~ 
H 10 7 1 9 7 = 3 2 7 8 4 3 27 

S 3 (if wo : =: = 0 0 2 5 3 3 13 

be E 7 1 9 16 6 57 1 0 0 7 4 i 23 
2 Hy 15 \ ey * -o-e 1 51 g 2 + 14 - 6 38 
& K ww’ Fs a ey se. 4 2 0 2 1 32Cti«4 
2 P 15 7 8 12 . we 0 0 1 17 1 8 27 
%& D 35 9 29 17 10 10 110 0 0 1 2 0 10 13 
5 M 0 6 1 y 3 3 22 3 a e 10 6 10 54 
6 N, o 21 5 2 se -&. 26 1 17 8 95 
N, 1 5 1 5 7 . Bm 34 i223 7 125 
N, 0 1 1 2 3 ln eae 17 35 1 124 
N, 4 10 2 a ae. ae 1 3 2 2 1 20 

= 100 100 100 100 100 100 600 100 100 100 100 100 100 600 





(Hysteric) A person who overreacts to emo- 
tional stimulation, who may be subject to crying 
and laughing spells, and who has a tendency to 
shift responsibility to other people. 

(Paranoid) A person who thinks he is being 
unfairly treated and who assumes that people 
watch his actions and talk about him more than 
they should. 


(Catatonic) A person who prefers solitude, 


does not care to have friends, and who is hardly 
known by his fellow workers. 

(Manic) A person who is very excitable, rest- 
less, moves rapidly from one activity to another, 
and who is enthusiastic about any scheme. 

(Depressive) A person who is usually sad, does 
not laugh easily, and who feels that life is full 
of hardships and disappointments. 

(Neutral,) A person who is even-tempered, 
who accepts both good and bad breaks without 
becoming upset, who is dependable, and who 


goes about his business in an alert and relaxed 
manner. 


(Neutral,) A person who assumes that every- 
one likes him, who likes to be with people, makes 
friends easily, and spends a great deal of time 
with friendly companions. 

(Neutral,) A person who enters into activities 
with enthusiasm, who gets a big kick out of do- 
ing things either alone or with others, and who 
enjoys life. 

(Neutral,) A person who is interested in “dat- 
ing,” but also enjoys the company of his own sex, 
who is understanding of the troubles of others, 
and who can’t bear to see anyone hurt. 


Each subject was tested individually. ‘The 


descriptive passages were placed before him 
in three rows in a predetermined order as 
follows: Row 1—E, N,, K, D; Row 2—N;, 
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M, N;, P; Row 3—S, Nz, H, Hy. The sub- 
ject was asked to read all of the descriptions 
and then each series of pictures was placed 
successively below the group of cards. For 
each series the subject was asked to match the 
pictures with what he believed to be the appro- 
priate behavioral description. He was told 
that more than one picture could be placed in 
a category and that first impressions should 
be used. Each series of pictures was placed 
before the subject in the order in which they 
appear for administration of the test. As each 
series was judged, these judgments were re- 
corded and the pictures were removed before 
the next series was administered. No time 
limit was imposed. 


Results and Discussion 


Table 1 shows the number of times each 
picture was associated with each descriptive 
passage (in terms of types of pictures). 

It is clear that “chance” does not explain 
the manner in which pictures are associated 
with verbal descriptive passages of the diag- 
nostic categories. Thus, for the summated 
data, and using marginal frequencies to deter- 
mine the “expected” frequencies, chi square is 
1600—a highly significant value for 77 de- 
grees of freedom (a chi square of 112 would 
be significant at the .01 level). There are 
numerous instances where the associations are 
much greater than one might expect “by 
chance”; and others where there is a marked 
lack of association of pictures with categories.* 


Another feature is the large differences in 
the number of times the various descriptive 
passages are used by the subjects. (The total 
times a passage was utilized can be determined 
by summing over all pictures. Thus, the H 


*The normal probability curve gives a satisfac- 
tory approximation to the binomial when fq is 
three or greater. Thus, for the summated data if 
one assumed that each descriptive category is “equal- 
ly likely,” ~ would equal 1/12 and one would ex- 
pect a mean frequency of mp — 50 in each cell. 
The variance, similarly, would be npg — 43.43. 
From this one might estimate that a frequency of 
65 or greater would occur less than once in one 
hundred times; and that one would obtain a fre- 
quency of 35 or less no more than one time in a 
hundred. There are, however, many frequencies in 
excess of 65 or lower than 35—and this leads to the 
same inference as obtained by means of the chi 
square. 
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category was used 335 times; § 514 times; 
etc.) The chi square determined between the 
“expected” distribution (each category being 
utilized 400 times) and the actually obseryed 
frequencies gives a value of 205. 
nificant for 11 df. Thus, one finds that with 
no restriction on the number of times a des- 
criptive category can be used, they are not 
utilized equally often. Apparently there occurs 
what might be termed a “response bias’’—the 
subjects tended to choose some types of des- 
criptions much more frequently than others. 
This “response bias’”’ was taken into account 
in determining the first chi square reported 
for the summated data since the marginal fre- 
quencies were used to obtain the “expected” 
frequencies. However, it means that any con- 
sideration of the number of “correct” associa- 
tions must also take this “response bias” into 
account. 


highly sig- 


Table 2 
Number of Associations of Each Type of Picture 
With Correct and Wrong Descriptions 
(Expected Frequencies in Parentheses) 


Type of 











Wrong 

Picture Association Association 
H 111 (41.875)* 489 (558.125) 

.) 69 (64.25) 531 (535.75) 

I 61 (47.25) 539 (552.75) 
Hy 45 (38.875) 555 (561.125) 

4 68 (59.50) 532 (540.50) 
P 68 (52.125) 532 (547.875) 
D 110 (58.625) 490 (541.375) 
M 54 (36.50) 546 (563.50) 

>» 586 (399.00) 4214 (4401.00) 
*The expected frequencies were determined taking into 
account the number of times each descriptive passage 


was used. Thus, since the “‘H” descriptive category was 
employed 335 times, one might expect 335/4800 of the 
600 judgments of the “H” pictures to be correct. 


If we consider the question of whether there 
are more “correct” associations than would be 
anticipated by chance alone, we find that for 
the over-all data there are 586 correct match- 
ings. One would expect but 4800/12, or 400, 
to be correct “by chance.” The excess is sig- 
nificant at the .01 level (chi square—94.3). 
Again, considering each type of picture sepa- 
rately we find that there is an excess of correct 
matchings for each type of picture. These 
data, tabulated for the computation of chi 
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square, are shown in Table 2. Chi square for 
this table is 193.9. While this chi square is 
highly significant, one important feature is the 
fact that the categories are not “homogen- 
eous” in their contributions to this value. 
Thus some pictures (viz., H and D) are 
correctly identified relatively frequently; 
while other types (viz., § and K) are correct- 
ly identified relatively infrequently. 


Table 3 
Number of Times Each Type of Picture was Asso- 
ciated with “Pathological” and 
“Neutral” Descriptions 
(Expected Frequencies in Parentheses) 





Placed in 





Placed in 
Type of “Pathological” “Neutral” 
Picture Category Category 
H 301 (399) 299 (201) 
s 283 (399) 317 (201) 
e 427 (399) 173 (201) 
Hy $15 (399) 85 (201) 
, 475 (399) 125 (201) 
P 454 (399) 146 (201) 
D 501 (399) 99 (201) 
M 236 (399) 364 (201) 
= 3192 (3192) 1608 (1608) 








In connection with this last result it is of 
interest to examine the utilization of the 
“neutral” categories. One might surmise that 
the neutral descriptions were too good “foils” 
for some types of pictures, and that this might 
explain to some extent why certain types of 
pictures were not correctly matched. In gener- 
al, this does not seem to be borne out. The 
data show that the neutral descriptions are uti- 
lized in proportion to their availability. Thus, 
one-third of the descriptive passages were “‘neu- 
tral’ —and these descriptions were employed 
1608 times, or almost exactly one-third of the 
time! On the other hand the various types of 
pictures were not associated with the neutral 
passages equally often. Table 3 shows the data 
tabulated in terms of the number of times each 
type of picture was associated with “patho- 
logical” and “neutral” categories. The H 
pictures, for example, were placed in the neu- 
tral categories much more frequently than 
would be expected by chance (and yet they 
were correctly matched to a significant de- 
gree). The Hy pictures were placed into path- 


ological categories very frequently. (But ap- 
parently they were not associated with the 
correct description much oftener than might 
be expected by chance. ) 

These analyses demonstrate that there are 
certain consistencies in the responses of sub- 
jects asked to match Szondi pictures with 
verbal descriptions. At the same time they 
show that the various kinds of pictures are not 
“homogeneous” in the sense that all can be cor- 
rectly identified equally often. Within types, 
too, there is a difference among pictures—some 
being correctly identified relatively frequently 
while others are misclassified consistently. 

From the frequent use of the “neutral’”’ des- 
criptive categories one might infer that the 
subjects do not associate certain pictures with 
pathological traits. Perhaps these pictures are 
not adequately expressive of the diagnostic 
categories. Another interpretation, of course, 
might be that the verbal descriptions used 
were inadequate; or that the basis on which 
selection or classification of pictures is made 
cannot be verbalized—i. e., is “unconscious.” 
It might be argued that requiring associations 
of pictures with verbal descriptions of this 
sort is unfair to the test in that this takes the 
items out of the regular test context. Another 
experiment, to be reported shortly, will pro- 
vide some information on this question. In 
this second study no verbal descriptions or 
neutral categories were used, and subjects 
matched pictures with pictures, thus eliminat- 
ing any need for verbalizing concepts or diag- 
nostic characteristics. 


Summary 


One hundred subjects were asked to match 
the 48 Szondi pictures with verbal passages 
describing, in a nontechnical manner, the diag- 
nostic categories represented by these pictures. 
Four “neutral” descriptions were also included 
in the group of verbal passages. The results 
showed: 

1. There were consistencies of responses, 
i. €., associations between pictures and verbal 
descriptions, which could not be attributed to 
“chance.” 

2. Given an opportunity to use the neutral 
descriptions with which to associate pictures, 
these types of descriptions were utilized in 
proportion to their availability. 
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3. For the pictures as a group there were 
more correct associations than would be ex- 
pected by chance. However, certain diagnos- 
tic categories are more frequently identified 
correctly than others. 

4. Pictures within categories do not seem 
to be consistent since some are much more fre- 
quently correctly identified than are others. 
Similarly, certain pictures seem to be associated 
with other categories than that to which they 
belong in a fairly consistent manner. 


Received March 17, 1952. 
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In recent years increased attention has been 
given to the problems of aging and the in- 
fluence of age on selected psychological attri- 
butes and functions. While several studies 
have investigated the possible relationship of 
increasing age to mental test performance [6], 
relatively few standard tests provide adequate 
age norms for adults or make any systematic 
allowance for this variable. This problem is 
of particular importance in appraising the 
mental functioning or impairment of clinical 
subjects. As some research has indicated [8], 
the memory impairment noted in certain psy- 
chopathological cases may be related in great 
part to the increased age of the patients. Two 
other studies [4, 7] with different hospital 
samples also have demonstrated that increased 
impairment of mental functioning as measured 
by some standard tests was associated with in- 
creasing age. Since such well-known scales of 
mental functioning as the Babcock-Levy [1] 
and the Shipley [10, 11] make no provisions 
for evaluating the effects of age on test per- 
formance, the utilization of the scales in 
clinical practice with older adults is open to 
possible criticism. 

A related problem concerns the use of vo- 
cabulary tests as a stable base line from which 
to evaluate efficiency of mental performance. 
Practically all tests of mental impairment are 
based on the assumption that the vocabulary 
function is peculiarly resistant to change in 
terms of such factors as age and psychopath- 
ology. As a result of this, a vocabulary test 
has been used as a measure of the optimum 
or previous level of performance and compared 


‘From the V. A. Regional Office, Chicago, IIli- 
nois, 


with tests of present mental functioning, 
usually tests involving speed of response, new 
learning, or concept formation. The discrep- 
ancy between the two types of test per- 
formance is then interpreted as an estimate of 
mental impairment or efficiency of function- 
ing. 

Research investigations of the relation of 
age to vocabulary level, however, have pro- 
duced somewhat conflicting results. Some have 
found that vocabulary level is stable at later 
adult years [3, 14], some have reported sta- 
bility through the sixth decade followed by a 
decline thereafter [9], one study secured a 
rise in vocabulary score up to age 55 with a 
subsequent decline [5], while others have re- 
ported an increase in vocabulary with increas- 
ing age [2, 7, 12, 13]. The fact that several 
studies have indicated either an increase or de- 
cline in vocabulary level at different ages is a 
finding of some importance in the light of the 
assumptions usually made for tests of mental 
impairment. This, however, is only one aspect 
of the broader problem of age norms for such 
tests. 


The Present Study 


The purpose of the present study was to 
evaluate the influence of age on the vocabulary 
and mental functioning of “normal” groups 
of subjects. It was decided to select subjects 
who were closely comparable in occupation 
and education, and by inference, of intellectual 
level. Thus, we might be able to ascertain to 
what extent the types of functions measured 
by a standard test of intellectual impairment 
are affected by the age of the subject. 

The test used was the Shipley-Institute of 
Living Scale for Measuring Intellectual Im- 
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pairment [10, 11].* This scale consists of two 
separate scales: a forty-item multiple choice 
vocabulary scale and a twenty-item completion 
abstraction scale. Norms are provided where- 
by the Vocabulary and Abstraction Scores of 
a subject can be converted into a Conceptual 
Quotient which gives one an estimate of men- 
tal impairment. Conceptual Quotients above 
90 are considered to be normal whereas values 
below 90 indicate varying probabilities of im- 
pairment. For example, a Conceptual Quotient 
of 85 is judged to be “slightly suspicious” with 
regard to mental impairment while a Con- 
ceptual Quotient below 70 is “probably path- 
ological.” 


Subjects 


The subjects used in this study consisted of 
three groups of 20 unmarried females, each 
group differing in age. The three groups were 
roughly comparable in terms of intelligence, 
education, and occupation. The age distri- 
butions were as follows: Group I, 20 to 30 
years with a mean age of 21 years, 11 months; 
Group IT, 40 to 50 years with a mean age of 
46 years, 4 months; and Group III, 60 to 70 
years with a mean age of 67 years, 2 months. 
The means are approximate since there was a 
strong reluctance on the part of subjects in the 
two oldest groups to reveal their exact age. 
In the interests of science, however, these 
subjects were prevailed upon to indicate the 
five-year interval in which their respective ages 
belonged. 














Table 1 
Educational Level of the Three Groups of Subjects 
Group Group Group 
Education I II Ill 
2 years of college 1 1 
3 years of college 1 2 1 
Bachelors Degree 18 11 14 
Masters Degree 1 6 a 
Totals 20 20 20 





In terms of occupation, the groups were 
divided in the following manner: retired 
teachers (Group III), teachers (Group IT), 
and persons graduating from a teachers college 


2Formerly called the Shipley-Hartford Retreat 
Scale. 
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who were about to begin the teaching pro- 
fession (Group I). All but five of the sub- 
jects had bachelors’ degrees with somewhat 
more of the older groups also having masters 
degrees (Table 1). All of the subjects were 
from a midwestern metropolitan area and were 
selected from lists of names obtained from 
local schools, teachers colleges, and a teachers’ 
association. Of those approached with regard 
to participation in the study only 11 refused 
to participate, and thus the selective factors 
are probably minimal. The test was admin- 
istered in the home of the subjects as part of 
a battery to study personality changes related 


to aging. All subjects cooperated unusually 
well. 


Results 


The mean scores and sigmas for the Vo- 
cabulary and Abstraction Scales and also for the 
Conceptual Quotients were computed for each 


Table 2 


Means and Sigmas of Various Measures on 
the Shipley Scale 











Group I Group II Group III 
Vocabulary Scores ae 
Means 31.30 34.85 37.55 
Sigmas 3.38 3.51 2.38 
Abstraction 
Scores 
Means 31.60 28.30 29.20 
Sigmas 4.08 4.11 4.53 
Conceptual 
Quotients 
Means 100.05 90.25 91.25 
Sigmas 8.60 8.76 9.78 





group of subjects. The findings are listed in 
Table 2. Inspection of the data reveals a 
definite increase in vocabulary score with in- 
crease in age, some superiority of the youngest 
group of subjects over the other two groups on 
the Abstraction scale, and a noticeable differ- 
ence in Conceptual Quotients in favor of the 
youngest group. Statistical comparisons be- 
tween the group means were then made utili- 
zing the ¢ test. 

The results of the statistical analysis are 
given in Table 3. Certain fairly distinct 
patterns are evident. The Vocabulary Score 
increases significantly with age, with the old- 
est group being significantly superior to the 
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Table 3 

Significance of Differences Between Group Mean 
Scores on Shipley Scale Determined 

by the ¢ Test 











Groups Groups Groups 
[&Il I& Ill II & Ill 
Vocabulary Scores 
t 3.169 6.578 2.783 
p 01 001 01 
Abstraction Scores 
t 2.481 1.717 .642 
p 02 10 50 
Conceptual Quotients 
t 3.475 2.953 .333 
t 01 01 50 


other two groups. Contrary to the general ex- 
pectancy that vocabulary is a relatively stable 
function, the present data reflect a positive 
relationship between vocabulary and age. The 
findings obtained with the Abstraction Scale 
are not as clear-cut as those obtained with the 
Vocabulary Scale. The only difference below 
the 5 per cent level of confidence occurs be- 
tween Groups 1 and II. However, the differ- 
ence between Groups I and III is close to this 
level of confidence. On the other hand the 
difference between Groups II and III, the 
two oldest groups, is not statistically signifi- 
cant. Furthermore, when one examines the 
mean scores, it is noted that the means for the 
two oldest groups are quite close together and 
differ more from the mean of the youngest 
group than they do from each other. In other 
words, if we look at the trends as well as the 
tests of significance, the two oldest groups are 
comparable in terms of the Abstraction Scale 
but function below the youngest group. This 
same trend is evident in the statistical com- 
parison of the groups with regard to the Con- 
ceptual Quotients. The two oldest groups do 
not differ significantly from each other, but 
both differ from the youngest group at the 
1 per cent level of confidence. 


The analysis of the test data thus indicates 
that the two older groups exceed the youngest 
group on the Vocabulary Scale but fall below 
the youngest group on the Abstraction Scale. 
As a result, the indicated level of mental 
functioning for the youngest group exceeds 
that of the older groups. Whereas only four 
subjects in Group I secure Conceptual Quo- 


tients below 90, nine and eight subjects in 
Group II and III, respectively, secure such 
Conceptual Quotients. Although the older 
subjects secure higher Vocabulary Scores than 
the younger subjects, in only a relatively few 
cases does this appear to be related to the 
greater number of below normal Conceptual 
Quotients among the older subjects. In most 
instances, the Conceptual Quotients are more 
directly related to lower Abstraction Scores. 
This is due to the fact that all Vocabulary 
Scores above 32 are interpreted in the same 
manner as are scores of 32 [11]. Consequently, 
in terms of Shipley’s scoring system, most 
cases are not unduly penalized because of high 
Vocabulary Scores. Nevertheless, approximate- 
ly twice as many subjects in the two older 
groups obtain Conceptual Quotients which are 
interpreted as being below normal than is true 
among the youngest group of subjects. 


Discussion 

The findings presented above agree essenti- 
ally with those reported by Sorenson [12], 
Sward [13], Rabin [7], and Christian and 
Paterson [2] in indicating some increase in 
vocabulary level with increasing age. Con- 
versely, they do not agree with other studies 
which have found either no rise in vocabulary 
after early maturity or a decline after the age 
of 60. Of possible significance is the fact that 
in the studies reporting an increase in vocabu- 
lary with increasing age, most of the subjects 
were above average in education, were en- 
gaged in some form of educational activity, or 
both. In the light of such results, Sorenson 
[12] has advanced the hypothesis that individ- 
uals who remain intellectually active may con- 
tinue to show gains in their verbal knowledge. 
Shakow and Goldman [9], on the other hand, 
interpret the rise in vocabulary with increas- 
ing age as indicating a greater selectivity 
among the older subjects with regard to in- 
tellectual status. However, when subjects have 
been equated in terms of similar education 
and occupation, there is no a priori reason or 
compelling evidence for believing that the 
older group is of higher intellectual caliber. 
From the data at hand we are inclined to give 
greater acceptance to the hypothesis of Soren- 
son, while also admitting that this view is open 
to further testing and verification. Additional 
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studies with groups of subjects selected in 
terms of various educational and occupational 
criteria may be of value in further clarifying 
this problem. 


The present findings also have some impli- 
cations for the evaluation of mental function- 
ing in clinical patients. The data indicate that 
younger subjects function somewhat more 
efficiently than older subjects on the Shipley 
Scale. Somewhat comparable findings have 
also been secured with hospital patients [4, 
7]. Consequently, since age appears to bear 
some relationship to indices of impairment, it 
seems likely that some provision for age norms 
in scales of impairment would insure a more 
meaningful appraisal of mental functioning 
in patients. 

As a concluding statement we would like 
to emphasize that the present group of subjects 
is a highly selected group intellectually, and 
other types of subjects conceivably might show 
different patterns of intellectual functioning 
with increasing age. From a clinical viewpoint 
it may be more worth while to study the 
effects of age on many highly selected groups 
of subjects than to study only a heterogeneous 
sample of subjects. 


Received February 20, 1952. 
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A Factorial Analysis of the Wechsler- Bellevue 
Scale Given to an Elderly Population’ 
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When one asks how the intellect ages, he 
is posing a difficult question. It seems reason- 
able that the various primary and second-order 
factors found in mental tests [7, 9, 10] will 
reach their maxima at different ages, will vary 
in the extent of their middle-age plateaus, and 
will vary in their rates of decline. It is im- 
portant to know whether our tests of these 
factors measure the same abilities at different 
age levels. Thus one might ask how verbal 
comprehension changes in later life. If the 
best measure we have of this factor in young 
adults, the vocabulary test, measures some- 
thing different in the elderly, we cannot say 
what happens to verbal comprehension in aging 
or perhaps in any condition in which one sus- 
pects intellectual functions may have altered. 

Balinsky [1] made one of the first attempts 
to determine what the Wechsler-Bellevue 
(W-B) group of tests [12] measures by a 
factor analysis of the scale in several groups 
of subjects ranging in age from 9 to 59 years. 
Unfortunately the Similarities and Vocabulary 
tests were not in the battery at the time his 
data were gathered. His analysis is thus based 
on nine rather than eleven subtests. Balinsky 
found that the average intercorrelation among 
the tests changed with age. It was .37 at age 
nine; it decreased in young adults and then 
rose to a high of .43 in the age range 55 to 
59 years. In the latter age group containing 
69 subjects, he reported three “psychologically 
meaningful” factors: reasoning, a performance 
factor, and a general factor. His reasoning 
factor was highest in the Arithmetic and Block 


1Grateful acknowledgement is due L. L. Thur- 
store for his suggestions in the analysis of the data 
and to Charlotte Fox Rogers and Jack Botwinick, 
who administered the tests. 


Design tests, the performance factor highest in 
the Object Assembly test, and the general fac- 
tor highest in the Comprehension and Informa- 
tion tests. Balinsky’s factors were obtained by a 
centroid factoring of the correlation matrix 
and the reference axes were rotated orthogon- 
ally. 

Goldfarb [6] also analyzed the Wechsler- 
Bellevue along with thirteen other measure- 
ments on 108 men aged 18 to 55 years. He 
extracted three factors by the Hotelling prin 
cipal-component method. He called his first 
factor a general factor, best identified as speed 
of mental operations. All the Wechsler tests 
except Information have a significant loading 
on the general factor. The second factor was 
called experience and had its highest loadings 
on the Information and Vocabulary tests. 
The third factor was not identified. 

It is clear that these studies did not yield 
homologous factors but they convincingly 
demonstrate that the W-B is a rather compli- 
cated measuring instrument and that it is diffi- 
cult to interpret a simple summation of scores, 
as in computing an IQ. 

The present study has two purposes: to 
determine what mental abilities are being 
measured by the W-B in an elderly group, and 
to align these factors if possible with the 
Thurstone factors. It seems desirable to use 
some standard nomenclature in identifying 
factors, and Thurstone’s terms are to be pre- 
ferred since they are based upon a number of 
analyses of sizable test batteries given to 
large groups of subjects. 

The W-B is hardly an adequate instrument 
for an experimental analysis of the age changes 
in the intellect. As a clinical instrument, how- 
ever, it has been given to large numbers of in- 
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dividuals and its wide use justifies an attempt 
to specify in some analytic manner what it 
measures in the elderly. It is also likely that 
its analysis can offer clues for more detailed 
investigations. A more thorough study of 
aging would administer a large number of 
well-studied tests such as the Thurstone tests 
of primary mental abilities [10] to subjects 
over a wide age range. 


Procedure 


Subjects. The subjects were white men and 
women aged 60 to 74 years inclusive. All sub- 
jects had four or more years of schooling and 
had English as their native language. Ali had 
hearing adequate to follow oral instructions 
and were able to read typewritten material 
(primer type, eight characters per inch). A 
total of 99 subjects was tested. The subjects 
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were in homes for the aged in the Baltimore 
area or in general hospitals convalescing from 
somatic disease presumably unrelated to men- 
tal performance. All subjects were given the 
tests individually by examiners trained in test- 
ing elderly persons. All tests were given in a 
single testing session unless the subject be- 
came fatigued, when an additional session was 
used to complete the testing. In addition to the 
elderly subjects described above, 31 hospital- 
ized mental patients with presumptive senile 
brain changes were tested [3]. These patients 
were also white and had four or more years 
of schooling and had English as their native 
language. The age range of the patients was 
60 to 70 years inclusive. 


Analysis of data. Product-moment corre- 
lations were computed between all tests and 
the resulting correlation matrix was factored 


Table 1 


Intercorrelations of the Eleven Subtests of the Wechsler-Bellevue 
in 99 Subjects Aged 60-74 Years* 








1 10 11 























2 3 4 5 6 9 
1. Information 70 38 60 72 81 56 65 47 34 58 
2. Comprehension 34 50 66 66 53 63 51 33 54 
3. Digit Span... 44 38 41 20 27 35 16 42 
4. Arithmetic.......... 63 62 44 51 57 24 50 
5. Similarities............ 71 57 60 56 45 60 
6. Vocabulary. i 51 67 49 28 59 
7. Picture Arrangement 67 53 45 66 
8. Picture Completion... 60 51 64 
FR 46 65 
10. Object Assembly...... 46 
11. Digit Symbol_ 
*Decimal point omitted. 
Table 2 
Intercorrelations of the Eleven Subtests of the Wechsler-Bellevue 
in 31 Senile Mental Patients Aged 60-70 Years* 
1 2 3 4 5 6 7 8 9 10 11 
I cs citecatanliahianbenadin 77 63 69 68 86 61 61 78 72 61 
eee 79 63 71 64 60 61 72 63 68 
SE eae 60 63 60 33 59 52 70 68 
re Ect hileanchconieneattiaieeree 36 56 ae 52 62 56 72 
ES eT 63 54 60 70 69 45 
eve eee 52 63 62 65 52 
7. Picture Arrangement. 64 70 51 40 
8. Picture Completion... 70 63 51 
Be ee 63 55 
19. Object Assembly.............._....... 49 
a = 





*Decimal point omitted. 
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by the centroid method. After the first factor- 
ing the resulting communalities were inserted 
in the diagonal cells of the original matrix and 
the matrix was factored a second time. Only 
trivial differences appeared between the first 
and second factorings. Four factors were ex- 
tracted and their reference axes were rotated 
to secure simple structure [11]. Some ambigu- 
ity appeared in the location of the plane for 
the first factor and for this reason two sets of 
rotations were determined. One result was ob- 
tained leaving the first reference axis orthogon- 
al to the other axes and a second result was 
obtained rotating the first reference axis into 
an oblique position. 


Results 


The intercorrelations of the eleven tests 
were rather high (Table 1) ; the median corre- 
lation coefficient was .53. Even higher inter- 
correlations were obtained in the 31 senile pa- 
tients where the median value was .63 (‘Table 
2). This agrees with Balinsky’s finding on 
somewhat younger subjects that the average 
intercorrelation of the W-B subtests rises in 
later life, and adds the further fact that in 
senile deterioration the correlations rise even 
further to a surprisingly high level for cogni- 
tive functions. 

As might be expected from such intercorre- 
lations, some of the centroid factor loadings 
are high. Four factors were extracted. The 
fourth factor residuals were small and would 
not justify further factoring.” 


There appeared a choice in the results of 
the elderly group in either leaving the first 
reference vector orthogonal or rotating it to a 
very acute oblique position. Both solutions are 


2To save printing costs, six tables and one figure 
supplementary to this article have been deposited 
with the American Documentation Institute, as fol- 
lows: Table A, Centroid factor loadings in the 
elderly group; Table B, Transformation matrix 
leaving first factor orthogonal; Table C, Cosines of 
angles between reference vectors of Table 3; Table 
D, Correlations between primary vectors of Tal.e 
3; Table E, Transformation matrix placing all 
vectors in oblique positions; Table F, Cosines of 
angles between oblique reference vectors of Table 
4; Figure 1, Factor plots of the four factors ex- 
tracted from the W-B given to the elderly popula- 
tion. Order Document No. 3676 from American Doc- 
umentation Institute, 1719 N St., N.W., Washing- 
ton 6, D.C., remitting $1.00 for 35 mm. microfilm 
or $1.20 for 6 x 8 in. photocopies. 


presented (Table 3 and Table 4). In the case 
of the orthogonal solution the first factor 
loadings are high and all tests show a signifi- 
cant loading (Table 3). In the oblique solu- 
tion the first factor loadings are not significant 
except for verbal tests (Table 4). However, 
in the oblique solution a large second-order 
factor would appear because the first factor 
would then correlate highly with Factors 2 
and 3 (Table 5). 


Table 3 


Rotated Factor Loadings in the Elderly Group 
Leaving the First Factor Orthogonal* 


l ests A B Cc D 
1 91 —()2 —07 03 
2 77 10 —(1 08 
3 45 03 41 —O(1 
+ 68 01 32 19 
5 80 00 05 33 
6 89 01 05 —4 
7 60 4 03 10 
8 72 37 —2 13 
9 55 38 38 29 
10 35 37 01 32 
il 66 l 31 00 
*Decimal point omitted. 
Table 4 


Rotated Factor Loadings in the Elderly Group 
with All Oblique Vectors* 


Tests A B ¢ D 
1 57 —vU1 —U7 03 
2 40 10 —U1 07 
3 16 00 41 —-u) 
+ 36 —(l 31 18 
5 51 01 03 33 
6 51 00 05 —03 
7 06 47 02 05 
8 20 39 —02 09 
9 03 37 37 25 

10 00 40 —O1 27 
11 —02 50 31 —05 





*Decimal point omitted. 











Table 5 
Correlations Between Primary Vectors of Table 4 
= elie Cc rr 
A 1.00 By 38 —.27 
B 72 1.00 13 —.03 
Cc 38 13 1.00 —.02 
D —.27 —.03 —.02 1.00 
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Interpretation of factors. The first factor 
is quite clearly the same as Thurstone’s 
Verbal Comprehension, Y [10]. This factor 
had its highest loadings in the Vocabulary, In- 
formation, Similarities, and Comprehension 
tests. Success on these tests apparently is 
largely a function of the amount of stored 
verbal information or meanings. Factors 1 and 
2 correlated quite highly (Table 5) suggest- 
ing that all eleven tests have a common factor 
in stored information. In the orthogonal solu- 
tion this fact appears as a first factor loading 
on all tests (Table 3). Highest loadings were 
in Vocabulary and Information tests and al- 
most account for all the variance of these 
tests. It appears that an oblique solution would 
place the first reference vector in an acute 
position and would yield a large second-order 
factor highly correlated with /, whereas the 
orthogonal solution shows loadings on all 
tests, highest in the / tests. Thus one could 
regard the results as indicating a second-order 
factor which would be identified as the mass 
of stored information possessed by individuals 
and correlating highly with the stored verbal 
information . In contrast, in the orthogonal 
solution one would regard the first primary 
simply as stored information. 

Factor 2 had loadings only in the so-called 
“performance tests” of the scale. All of these 
tests require visual rather than verbal processes 
in solution of the problems. Of the various fac- 
tors which might be present here, Perceptual 
Speed, Closure, and Spatial Relations, the 
element of Closure seems to be the common 
element in the tests. This would be the C, 
factor of Thurstone; Jones [7] found a simi- 
lar factor in the Stanford-Binet. The closure 
factor is the ability to fuse the perceptual field 
into a single percept. It should be pointed out 
that the C, factor was highly correlated, .72, 
with V in the oblique solution (Table 5). 
This suggests that stored information or 
familiarity with similar material is necessary 
in closure. In this factor, however, C, pre- 
sumably would be correlated with stored 
“visual meanings.” 

Factor 3 had loadings on four tests: 
Digit Span, Arithmetic, Block Designs, and 
Digit Symbol. The Digit Span test has been 


shown to measure a memory factor, M, but 
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has no loadings on other factors [10]. For this 
reason it appears desirable to pivot the inter- 
pretation of the factor on the content of the 
Digit Span test. It is clear that memory is in- 
volved in the Arithmetic and Digit Symbol 
tests. Less clear is the role of memory in the 
Block Design test, although even in this test 
the ability to remember the analysis of the 
patterns and the sequence of operations may 
be important in arranging the blocks. Factor 
M was also correlated with V in the oblique 
solution, .38 (Table 5). 

Factor 4 is most ambiguous not only because 
of low loadings but also because of the tests 
in which it is demonstrated: Similarities, 
Block Design, and Object Assembly. About 
the only feature these tests have in common is 
induction. This interpretation would seem 
plausible because the factor has its highest 
loading in the Similarities test, a test in which 
one has to tell how two objects are alike or 
similar. No great confidence, however, can be 
placed in either existence of the factor in the 
tests or its interpretation. 

Factor scores. “Factor scores” on V, C, 
and M were obtained for individual subjects 
in the elderly and senile groups. These scores 
were obtained by averaging the best tests of 
each factor, weighting the tests equally: 





: 21,2,6 27,8,10 
V score = ; C, score = ————-; 
3 3 
>3,4,11 
M score = “— ; 


In order to make the senile and elderly groups 
more comparable, a subsample of 50 subjects 
was drawn from the total of 99 elderly sub- 
jects. The median years of education was 7 
for the senile group and 8 for the elderly 
group. The median age was 65 years, range 
60-70 for the senile group and 64 years, range 
60-69 years, for the elderly group. 

The assumption is made here that a lower 
factor score in the senile group would reflect 
a deficit in the factor induced by brain disease. 
Since the W-B subtests were standardized to 
yield mean scores of 10, one regards any differ- 
ence between 10 and the mean of the present 
group as loss. Such loss however need not be 
in the factors named here. Only the difference 
between the means of the two groups can be in- 


cts 
1b- 
, 7 
rly 
ge 
ge 


Factorial Analysis of Wechsler Given to Elderly 403 


terpreted as a possible factor deficit. It is in- 
teresting in this regard that the mean V score 
of the elderly group did approach a mean of 10 
(Table 6). This is in accord with previous 
findings that vocabulary size is relatively un- 
changed in later life [5]. The senile group 
showed a loss compared with the elderly in all 
factors. The biggest relative loss was in C, 


(Table 6). 
Table 6 


Mean Factor Scores of the Elderly and 
Senile Groups 


V C, M 

Mean ¢ Mean ¢@ Mean ¢ 

Elderly N=—50 98 2.3 6.6 2.3 6.7 2.4 
Patients 

N 31 64 2.9 3.6 2.4 4.6 2.9 


Discussion 


The factors identified in the present study 
appear to be equivalent in several instances to 
those found by Balinsky in his analysis of the 
scale [1]. His G factor obtained in an orthog- 
onal solution is equivalent to the first primary 
of the present study in the orthogonal solution. 
He comments on the fact that he found no 
clear verbal factor in his oldest group, age 
50 to 59 years. It is suggested that the reason 
the verbal factor did not appear was that it 
was so highly correlated with a second-order 
factor that it disappeared in his method of ro- 
tation. In the present study the verbal factor, 
V, appeared clearly only after an oblique ro- 
tation. His G thus consists of V and a second- 
order factor. It does not seem likely that his G 
is really a general intellective factor since not 
all of the primary factors of the present study 
were correlated. Hence a general second-order 
factor would not be obtained. The perform- 
ance factor of Balinsky appears equivalent to 
the closure factor of the present study. There 
is also an element of similarity between the 
Memory factor of the present study and his 
reasoning factor. To permit a more detailed 
comparison of the factors it would be necessary 
to maximize the zero loadings in Balinsky’s 
study by means of oblique rotations. The 
second factor of Goldfarb [6] was positively 
related to age and had its highest loadings 


on the Vocabulary and Information tests. 
Without a doubt this factor is the / of the 
present study. Goldfarb thought this factor 
represented wisdom or experience; such an 
interpretation is homologous with the present 
one, i. ¢., stored verbal meanings. 

The present results would suggest that in- 
dividual differences in stored information are 
the largest single component in determining 
individual differences in test scores on the 
Wechsler tests. In young adults a high score 
on the W-B would presumably reflect high 
abilities in factors used in extracting informa- 
tion from an adequate stream of information 
surrounding the individual. The score might 
thus be used to give a good estimate of the 
probabilities of the individual’s accumulating 
further information. In individuals with brain 
damage a high V score would indicate that the 
individual had had the ability to extract and 
store information; however, at the time of 
testing he may have a considerable reduction 
in the abilities used to acquire the information. 

Intelligence tests such as the Wechsler- 
Bellevue seem to measure some diffuse com- 
bination of (a) the mass of previously learned 
or stored information and (4) abilities to ac- 
quire information. Success on a test item 
would seem to be determined by whether a 
similar bit of information has previously been 
presented to the individual, by his capacity to 
store it, and by his ability to associate a cur- 
rently presented bit of information with rele- 
vant stored information and pertinent infor- 
mation in the context of the item. As an in- 
dividual grows older his environment changes 
in the relative frequency of occurrence of cer- 
tain types of information. In addition to the age 
changes in the frequency with which certain in- 
formation occurs or is reinforced, there is the 
possibility that the individual suffers a dim- 
inution of abilities. He may not learn as 
rapidly or associate a presented bit of informa- 
tion with his stored information. One of the 
goals of studies of aging of the intellect is to 
be able to express the rate of change in the 
various abilities with age, and the rate of 
change in stored information, as partial deriv- 
atives of some function of test scores. It seems 
reasonable that the age of optimum learning 
and the age of maximum stored information 
are not coincident and do not correspond to 
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the age of maximum scores on our intelligence 
tests. Much more research is needed before we 
can make these relations explicit. 


The intercorrelations of the primaries were 
not factored in the present study because four 
factors would not permit the extraction of 
more than one second-order factor. The high 
intercorrelation between the primaries V and 
C, suggests to the writer that these factors 
share a common background of stored infor- 
mation or experience. The best representation 
of V’ is the vocabulary test which more than any 
other task requires simple verbatim recall of 
stored information. Jones found a second-order 
factor in the Stanford-Binet he tentatively 
called “ability of subjects to profit from scho- 
lastic experience” [7, p. 11]. This second- 
order factor had its highest loadings in V 
tests, i. e., tests of “the process of supplying 
previously learned linguistic responses, prim- 
arily word meanings” [7, p. 4]. From obser- 
vations of aphasia it is clear that words may 
imply “motor meanings” as well as verbal 
ones. This gives the clue that information is 
stored in several manners and a high corre- 
lation of the primary factors may reflect the 
fact that an individual with a large number 
of stored verbally expressible meanings tends 
to have a large number of nonverbal meanings 
as well. 

Generally, vocabulary size is stable in later 
life [5]. Since V is a large component in the 
Wechsler IQ, the ability to acquire informa- 
tion may be much less in the senile patients 
measured in this study than their total scores 
would suggest. In a review of the literature 
on the W-B tests [8], it appears that tests 
high in VY show little change with therapy. If 
the individual scores on the W-B are affected 
at all by therapy, the response lies in the 
largely nonverbal tests. Furthermore, even in 
the case of neurological disease, V is relatively 
little changed, whereas if any change occurs it 
occurs in tests low in V. This is certainly com- 
patible with the present interpretation of the 
nature of the W-B tests. In aging, as in other 
problems touching upon intelligence, the in- 
vestigative task is to describe the domain of 
stored information and the domain of dynamic 
abilities. By appropriate analysis one should 
be able to detect transient and permanent gains 
and losses in intellective abilities. 


James E. Birren 


Summary and Conclusions 


1. A factor analysis was made of the 
Wechsler-Bellevue Scale. The subtest inter- 
correlations were obtained from the scores of 
99 subjects between the ages of 60 and 74 
years inclusive. Four factors were extracted 
from the correlation matrix. After rotation to 
simple structure these factors were identified 
as ’, Verbal Comprehension ; C,, Closure; M, 
Rote Memory; and the last factor was tenta- 
tively identified as 7, Induction. 

2. A choice appeared in the treatment of the 
first factor, V. If this factor is left in an ortho- 
gonal position it appears as a general primary 
factor found in previous studies of the test in 
somewhat younger subjects. If, however, this 
factor is rotated to an oblique position, it makes 
a very acute angle with the second primary, 
C,, resulting in a high correlation between V 
and C,. These results are interpreted as in- 
dicating a large second-order factor consisting 
of stored information or experience which is 
correlated with both VY and C,. Because of 
the high correlation that would obtain be- 
tween /’ and the second-order factor, it seems 
reasonable that the distinction between V and 
a possible second-order factor would be am- 
biguous in some studies. However, the inter- 
pretation favored here is that there is a large 
factor of stored information of which V is one 
aspect, but a large one, i. e., V is stored verbal 
meanings. 

3. An attempt was made to secure factor 
scores for V, C,, and M by combining tests 
high in these factors. The V score was obtained 
by averaging the Information, Comprehension, 
and Vocabulary tests; the C, score by averag- 
ing the Picture Arrangement, Picture Comple- 
tion, and Object Assembly tests; and the M 
score by averaging the Digit Span, Arithmetic, 
and Digit Symbol tests. Little change was 
noted in V with age. A group of 31 senile 
patients, however, showed a loss in V. The 
greatest relative loss in the senile group oc- 
curred in C. 

4. The close association of test performance 
in the elderly with stored information suggests 
that to a greater extent than in young subjects 
their performance is determined by what they 
already know rather than by what new infor- 
mation they can elicit from the test situation. 
With senile brain damage there is not only a 
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further reduction in the ability to extract 
new information but there is also a large loss 
in stored information as well. 

5. It is apparent that the W-B does not 
include tests of many of the known primary 
mental abilities. For this reason it is of limited 
use in attempting to describe age changes in 
the intellect. As a test of practical length, how- 
ever, it can be used successfully with individ- 
uals. It would seem desirable, though, to in- 
clude tests of factors not represented in the 
present eleven tests perhaps at the expense of 
dropping several of the present tests where 
they are highly correlated. 


Received February 8, 1952. 
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Centile Scores for the#Wechsler- Bellevue Intelli- 
gence Scale ona University Counseling Center Group 


Reed M. Merrill and Louise B. Heathers 


University of Washington Counseling Center 


The Wechsler-Bellevue Intelligence Scale 
is one of the most popular measures of adult 
intelligence in the field of testing. The de- 
velopment of counseling centers in universities 
and of training facilities for graduate students 
in clinical psychology has probably increased 
its use in academic counseling situations. A 
series of papers is planned to provide further 
information about the test relative to a uni- 
versity population since the present informa- 
tion in this area is fairly limited [4, 5]. The 
purpose of the present paper is to present cen- 
tile scores for the W-B based on a university 
counseling center population. Such informa- 
tion should increase the usefulness of the test 
to counselors. 

The counseling center at the University of 
Washington, a large state university, combines 
the functions of a university student counsel- 
ing center and a veterans guidance center. 
Though some students come to the center 
primarily for personal counseling, the majority 
come for vocational counseling, i. e., for help 
in formulating their vocational plans. The 
client population is very predominantly male; 
as would be expected, lower division students 
are seen more frequently than upper division 
or graduate students. 

With the exception of the various editions 
of the ACF, the W-B is used more frequent- 
ly than any other test of mental ability at this 
counseling center. The assigning of this test 
in preference to others is determined by the 
needs of individual clients and the needs of 
practicum students. Since test batteries for 
individual clients were planned by 16 different 
counselors during the period covered by this 
study and since all counselors were urged to 
use this test whenever possible to provide ex- 
perience for practicum students, it is improb- 
able that there was a systematic bias in the 


selection of the clients to whom the test was 
given. 

Method. The group for this study consisted 
of all the male student clients who were tested 
at the center on the W-B between September 
1, 1948 and August 31, 1950 with the excep- 
tion of 17 Japanese students who were con- 
sidered atypical because of their bilingual back- 
grounds. Of the 450 students given the test, 
257 were veterans and 193 were nonveterans. 
As would be expected for the period of study, 
the veteran and nonveteran student client 
groups differed significantly in age and ed- 
ucational level; the veteran group was signifi- 
cantly older and more varied in age than the 
nonveteran group. In addition there were sig- 
nificantly more upper division students in the 
veteran than in the nonveteran group; the ¢ 
ratio between the percentage of lower division 
students and the percentage of upper division 
and graduate students was 2.60 which is sig- 
nificant above the 1% level. Tables 1 and 2 
present these data. 


Table 1 


Age Distribution of Veteran and Nonveteran 
Counseling Center Clients 








Range 








Group Mdn. Q 
Veterans (N—=257) 24.02 2.53 19-48 
Nonvets (N=—=193) 20.06 1.60 17-44 
CR 8.08 >.01 4.23>.01 
Table 2 


Educational Class Distribution of Veteran and 
Nonveteran Counseling Center Clients 











G Fresh. Soph. Junior Senior Grad. 
_— N% N% N% N% N% 
Veterans 
(N=257) 86 33 61 24 55 21 3313 22 9 
Nonvets 
(N—193) 69 36 66 34 3719 147 7 4 





= 12.18, P> .02. 
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The tests were administered by ten graduate 
students in clinical psychology; eight of these 
had received their training on the W-B from 
the junior author. In addition, tests given by 
students at the center are regularly rescored. 
Wechsler’s [7] administration and scoring 
standards are adhered to rigidly. 


Table 3 


Means and Standard Deviations of W-B IQ Scores 
for Veteran and Nonveteran Counseling 
Center Clients 


Veterans Nonveterans 





IO Score (N = 257) (N = 193) CR 
Verbal Scale 
M 118.60 118.99 47 
o. 55 62 
SD 8.75 8.64 59 
0. 39 44 
Performance Scale 
M 119.83 118.30 1.65 
e.. 59 72 
SD 9.43 10.04 .92 
Gy 42 51 
I Scale 
M 121.03 121.06 .04 
C.. 52 63 
SD 8.31 8.75 58 


o,, 37 AS 





Results. As may be seen in Table 3, in spite 
of these age and educational differences, the 
two groups do not differ significantly in either 
means or standard deviations on any of the 
three scales of the W-B. Contrary to Estes’ 
[3] data, this group, like Anderson’s [1], does 
not do better on the Verbal than on the Per- 
formance Scale. 


Wechsler’s [7, p. 126] data show that 
superior Ss, those with W-B IQ’s of 120 and 
above, will generally score higher on the 
Verbal than on the Performance Scale. How- 
ever, if only those Ss with Full Scale 1Q’s of 
120 and above (272 or 60% of the Ss) are 
considered, the relationship indicated above 
still holds. Verbal IQ’s were higher than Per- 
formance IQ’s in only 44.5% of these Ss and 
were lower than Performance IQ’s in 50.7%. 
Wechsler’s comparable data are 77.4% and 
21.0%, respectively. For this group the median 
difference between Verbal and Performance 


I1Q’s was 6.1 with a Q of 5.0; Wechsler’s 
corresponding data are 11.1 and 4.7. The 
group used here is much larger than that of 
Wechsler and is primarily from the western 
rather than the eastern section of the United 
States. In Rapaport’s [6] data, 21 of his 54 
patrolmen had Full Scale 1Q’s of 120 or 
above. Only 54.4% of these Ss had Verbal 
IQ’s above Performance 1Q’s and 42.9% had 
Verbal 1Q’s below Performance 1Q’s. Thus 
the results of this study and of Rapaport’s con- 
trol group do not support the hypothesis that 
Ss with Full Scale W-B I1Q’s of 120 and above 
will generally have higher Verbal than Per- 
formance IQ's. 


Table 4 
Centiles for W-B IQ Scores for Counseling Center 
Clients (N = 450) and for Raw Scores on the 
1949 ACE for Entering Freshmen at the 
University of Washington 











W-B IQ Scores 1949 ACE Raw Scores 

Vv P Full Q L Total 

Centile Seale Scale Scale Scale Scale Seale 
99 137 138 138 65 106 164 

95 132 133 134 57 97 148-149 

90 130 130 131 54 90 138-139 
85 128 129 129 61 86 134 
80 ad 128 128 50 82 129 
76 125 126 12 48 79 125 
70 124 125 125 46 77 122 
65 122 124 125 45 74 119 
60 121 122 124 44 72 116 
55 119 121 123 43 70 113 
60 119 119 122 42 68 110 
45 118 118 121 41 66 107 
40 118 118 120 39 64 104 
35 117 116 118 38 62 102 
30 115 116 118 37 60 yy 
25 114 113 116 86 58 96 
20 113 lll 114 35 56 93 
15 lll 110 113 32 53 89 
10 107 106 109 29 50 83-8. 
5 104 102 106 24 45 73-756 


1 94-97 95 100-101 13-16 36-38 55-62 





Since the two groups of male student coun- 
seling center clients performed so similarly on 
the test, they were combined to obtain the 
centile scores. Wechsler-Bellevue 1Q scores 
corresponding, approximately, to various cen- 
tile scores are given in Table 4.2 The Table 
also includes the raw scores on the 1949 edition 
of the ACE which correspond to given centile 
scores for entering freshmen at this university. 


‘Precise centile tables are available on request. 
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Since the ACE is used by many universities 
as a measure of scholastic aptitude, the in- 
clusion of the ACE data will enable counsel- 
ors to evaluate the appropriateness of these 
W-B centiles for their student populations. 


Comparison with a Nonclient Male Student 
Group. The above data would be more useful 
if one knew how well a counseling center 
population represents a total university popu- 
lation. Since many students, particularly non- 
veterans, seek a counseling center because of 
academic difficulties, a counseling center popu- 
lation may be somewhat less capable than the 
total university population. The data given 
below are presented in an attempt to evaluate 
how representative the client group was of the 
total university population. 

Only 251 of the 450 Ss in the counseling 
center group had taken some edition of the 
ACE as freshmen at this university. Chi 
squares? between obtained and expected decile 
distributions on local ACE norms gave a p of 
approximately .50 on the Total ACE distri- 
bution. This shows that the counseling center 
group is representative of the entering fresh- 
men group on the Total ACF. 

To get a relatively comparable nonclient 
student group, all W-B’s given over a period 
of years by graduate students in training in a 
mental testing course were reviewed. This re- 
view resulted in the collection of 280 records 
of male students. These students were, for the 
most part, members of psychology classes who 
volunteered to serve as subjects for the grad- 
uate students. Although the age and educa- 
tional distributions of this group are quite 
similar to those of the entire university for the 
period covered by the study—/’s from chi 
squares were approximately .70 and .90, re- 
spectively—this is certainly not a random 
sample of the university population. In recent 
years names have been omitted from these 
tests. Hence local entering freshmen ACE 
centiles were available on only about a third 
of the volunteer group (99 or 35% of the 
Ss). The approximate » value for the chi 
square between obtained and expected Total 
ACE entering freshmen decile distributions 
for this limited sample was .20. 


2Df’s equal to N—3 were used in testing the sig- 
nificance of the obtained chi squares. 


Reed M. Merrill and Louise B 
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Table 5 


W-B IQ Scores for Counseling Center Clients 
(N=—=450) and for Volunteers (N=—=280) 


IQ Score Clients Volunteers CR 
Verbal Scale 
VM 118.78 121.66 4.30 
0 41 53 
SD 8.70 8.85 32 
0 .29 37 
Performance Scale 
Vl 119.17 119.65 69 
o. 46 53 
SD 9.69 8.91 1.56 
o.. .32 38 
Full Scale 
M 121.03 122.74 2.80 
a 40 A6 
SD 8.49 7.7 1.67 
c.. .28 33 


As may be seen from Table 5 this volunteer, 
nonclient group does significantly better, statis- 
tically, than the counseling center group on 
both the Verbal and the Full Scales of the W-B 
and, unlike the counseling center group, does 
significantly better, statistically, on the Verbal 
than on the Performance Scale. The ¢ ratio 
between the Verbal and Performance IQ’s for 
the volunteer group is 3.47 which is significant 
above the 1% level. If only those Ss with I1Q’s 
of 120 and above are used (186 or 66% of 
the Ss), only 56.4% of the Ss have Verbal 
IQ’s higher than Performance IQ’s and 
39.8% have Verbal IQ’s lower than Perform- 
ance 1Q’s. The median difference between 
Verbal and Performance IQ’s was 7.6 with a 
O of 5.0. These percentages are more similar 
to those obtained on the client group than to 
those reported by Wechsler. 

Discussion. It is felt that the centiles re- 
ported here will be found adequate for use in 
university counseling situations. Although the 
differences between the mean Verbal and Full 
Scale IQ’s for the counseling and volunteer 
groups are statistically significant, the ob- 
tained differences—2.88 and 1.71 IQ points, 
respectively—have little practical significance 
in view of the size of the standard error of 
measurement of the test. The large number 
of cases involved contributes to statistical sig- 
nificance but the “practical” significance is 
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not challenged [2]. Moreover, it is believed 
that the counseling center group deviates less 
from the total university population than the 
volunteer group results would suggest. There 
were proportionately more upper division and 
graduate students in the volunteer than in the 
counseling center group. Just as it seems 
plausible that students seeking a counseling 
center may be slightly less able than students 
in general, so is it equally plausible that stu- 
dents who volunteer to take intelligence tests 
will, as a group, tend to be slightly more able 
than students in general. 

Summary. Centile scores for W-B Verbal, 
Performance, and Full Scale IQ’s are pre- 
sented based on 450 male student counseling 
center clients at a large state university. Local 
centiles on the 1949 ACE are also presented 
to increase the usefulness of the W-B centiles. 
The performance of the counseling center 
group is compared with that of 280 nonclient 
male students who volunteered to serve as 
subjects for a class in mental testing. For prac- 
tical purposes, the centiles reported will re- 
flect fairly adequately an individual university 


student’s performance on the W-B relative to 
that of other male students. 


Received April 8, 1952. 
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Some Further Evidence on the Validity of the WISC 


Paul Mussen, Sanford Dean, and Margery Rosenberg 


The Ohio State University 


Almost all the reported studies of the valid- 
ity of the Wechsler Intelligence Scale for 
Children (WISC) have been concerned only 
with its relationship to other measures of in- 
telligence [1, 2, 3]. The present investigation 
presents some additional data on the validity 
of the WISC, using three traditional criteria 
of validity: (a) The Stanford-Binet, (5) 
achievement as measured by standardized 
achievement tests, and (c) teachers’ ratings of 
intelligence. 


In this study, the WISC was administered 
to 62 children, ages 6-0 to 13-1, attending 
grades 1 to 7 at the Ohio State University 
School. The 39 children who were above the 
first grade had been given the Revised Stan- 
ford-Binet at least a year previously. Metro- 
politan Reading and Arithmetic Achievement 
Tests had been administered to the pupils in 
grades 2 to 4, while Stanford Reading and 
Arithmetic Achievement Tests were given in 
the upper grades. No achievement test scores 
were available for our 23 subjects in the first 
grade. However, a teacher’s rating of intelli- 
gence on the Haggerty-Olson-Wickman Rat- 
ing Scale of Intelligence was obtained for each 
subject. 


Table 1 
Intercorrelations, Means, and Sigmas of S—B 
and WISC IO for 39 Children 














S-B W-F W-V W-P 
WISC, Full Scale 85 
WISC, Verbal Scale 83 .94 
WISC, Performance 
Scale .72 89 .73 
Mean 120.3 106.6 109.0 107.7 
Sigma 27.0 16.7 18.0 14.7 





Table 1 presents the intercorrelations be- 
tween the S-B and the WISC Full Scale, 
Verbal, and Performance IQ’s as well as the 
means and sigmas for all these scales. ‘These 


correlations are consistent with those found by 
previous investigators [1, 2, 3]. Both the 
WISC Verbal and Performance IQ’s correlate 
highly with the WISC Full Scale IQ and 
there is an appreciable relationship between 
the S-B IQ and each of the three WISC IQ’s. 
To a considerable degree, all four scales seem 
to be measuring the same factor or factors. 

The mean S-B IQ was significantly higher 
than any of the three mean WISC IQ’s. 
These differences, which have been noted in an 
earlier study [3], were particularly marked 
in this investigation because our population 
was a highly selected one and the mean S-B 
IQ was considerably elevated by several scores 
over 154, the upper limit of the WISC 
scores. 

Our second and third criteria of validity 
were teachers’ ratings of intelligence and 
school achievement as measured by Metropoli- 
tan and Stanford Reading and Arithmetic 
Achievement Tests. Achievement test scores 
were converted into equivalent age scores and 
these into reading and arithmetic quotients. 
The correlations between these quotients and 
S-B and WISC IQ’s, together with the corre- 
lations between the intelligence scales and 
teachers’ ratings, are presented in Table 2. 


Table 2 
Correlations of S-B and WISC IQ’s with Teachers’ 
Ratings of Intelligence and Reading and 
Arithmetic Quotients from Two 
Achievement Tests 











Teach- Metro- Metro- Stan- Stan- 
ers politan politan ford ford 
Rat- Arith- Read- Arith- Read- 
ings metic ing metic ing 
(N=62) (N=21) (N=21) (N=18) (N=18) 
WISC-F 68 81 -75 44 .69 
WISC-V 64 74 62 47 -78 
WISC-P .58 -74 -76 29 57 
S-B -76* -76 -59 45 65 





*Based on 39 cases. 
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These data make it clear that, while the 
S-B correlates more highly with teachers’ esti- 
mates of intelligence, the WISC Full Scale 
and Verbal IQ’s predict objectively measured 
school achievement slightly more accurately 
than the S-B does. As might have been ex- 
pected, the WISC Performance IQ does not 
correlate as well as the other intelligence 
measures with either school achievement or 
teachers’ estimates of intelligence, since both of 
these are more dependent on verbal than on 
performance ability. Generally speaking, these 
correlations are quite similar to those found 
by Frandsen and Higginson [1] and all the 
correlations are relatively high. It may be con- 
cluded that all four scales are valid predictors 
of school achievement as measured here. 


Summary and Conclusions 


The WISC was administered to a group of 
62 children in grades 1 to 7. Teachers’ ratings 
of intelligence were obtained for all subjects, 
and scores on the Revised Stanford-Binet In- 
telligence Scale and standard achievement 
tests were available for 39 of them. 


The intercorrelations among the intelligence 
measures, achievement tests, and teachers’ 
ratings reveal that: (a) To a considerable de- 
gree the WISC Full Scale, Verbal, and Per- 
formance IQ’s and S-B IQ measure the same 
factor or factors; (4) All three WISC IQ’s 
and the S-B IQ seem to be valid predictors 
of school achievement as measured by standard 
achievement tests; (¢) All four measures of 
intelligence are highly correlated with teach- 
ers’ estimates of intelligence. 


Received February 25, 1952. 
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Alxeander, Franz, & Ross, Helen. (Eds.) Dynamic 
psychiatry. Chicago: Univer. of Chicago Press, 


1952. Pp. xii + 578. $10.00. 


This treatise by Alexander and fourteen col- 
laborators marks an important step in the coming of 
age of dynamic psychiatry. The book’s framework 
is basically psychoanalytic, but it differs from many 
other analytic volumes in drawing on the older con- 
tributions without oily eulogies and in departing 
from them, when the evidence so requires, without 
defensive contentiousness. In a word, it is mature. 
It is also uncommonly well integrated for a col- 
laborative enterprise. Among the many outstanding 
chapters, there seems to be particular merit in 
Alexander’s clear statement of fundamental concepts, 
in Maurice Levine’s on principles of treatment, and 
in Henry W. Brosin’s three extensively documented 
summaries of the literature on the contributions of 
psychoanalysis to organic disorders, to psychoses, 
and to current thought. Two nonmedical contribu- 
tors, Margaret Mead and David Shakow, have 
sections on anthropology and on clinical psychology. 
Although intended mainly as a book for psychia- 
trists, the volume has much to offer as a text and 


reference for education in clinical psychology. — 
L. F. S. 


Benedek, Therese. Studies in psychosomatic medi- 
cine: Psychosexual functions in women. New 
York: Ronald Press, 1952. Pp. x + 435. $10.00. 


Eleven of the fifteen chapters of this book reprint 
a research monograph, originally published in 
1942, on the relationships between the ovulative 
cycle and emotional changes in a group of 15 
women. Sexual drive and related attitudes were 
determined by psychoanalytic studies; the physio- 
logical data were obtained independently. The 
correlations between the physiological cycle andthe 
psychological processes are clearly evident, and the 
author concludes that “the sexual drive repeats a 
developmental integration in each cycle,” paralleling 
the childhood developmental sequence. Tl] e remain- 
ing chapters of the book are essays on other psy- 
chosomatic aspects of sex in women. The chapter 
which views the climacterium as a positive develop- 





Note: The reviews are prepared by the Editor 
and Associate Editors, who may be identified by 
their initials. 
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mental phase, rather than as an aspect of aging, is 
especially interesting.—L. F. S. 
Bryan, Alice I. The public librarian. New York: 


Columbia Univer. Press, 1952. Pp. xxvii + 474. 
$6.00. 


This is a report of the Public Library Inquiry, 
conducted by the Social Science Research Council 
under a grant from tne Carnegie Corporation. The 
volume covers an analysis of current library per- 
sonnel and personnel practice, and a discussion of 
library schools. The basic data were obtained from 
a sample of 58 libraries, and from 2,395 librarians. 
In addition to analyses of personnel and adminis- 
trative practices in the libraries, the report includes 
results of the Guilford-Martin Inventory of Fac- 
tors of GAMIN, the Strong Vocational Interest 
Blank, and a 52-page personnel questionnaire filled 
out by the librarians. Librarians are found to be 
established norms in self-confidence and 
leadership, and their occupational in- 
terests are most similar to those of artists and writ- 
ers and (for the men) musicians. Major findings 
include the inadequacies of salaries and the inade- 
quacies and inequities of career patterns and of 
personnel management and policies.—A. R. 


below the 
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Buhler, 
Frances 


Charlotte, Lefever, D. Welty, Kallstedt, 
E., & Peak, Horace M. Development of 
Rorschach score: supplementary mono- 
graph. Los Angeles: Rorschach Standardization 
Studies, 1127 North Sweetzer Ave., 1952. Pp. iv 
$2.50. 


the basic 


Four separate studies comprise this monograph, 
which is an outgrowth and extension of Rorschach 
Standardization Study I, published in 1948. The 
first study, by Lefever and Buhler, provides a check, 
based on an independent sample of 360 cases, on the 
scoring weights from which the Basic Rorschach 
Score was earlier derived. Agreement between the 
weights obtained from the original and present 
samples is high. The second study summarizes an 
investigation by Kallstedt on the Rorschachs of 66 
well-adjusted adolescents. Profiles of these adoles- 
cents are somewhat similar to those obtained by 
adults suffering from anxiety neurosis—a finding 
which seems to fit the general psychological knowl- 
edge of adolescence closely enough to justify apply- 
ing the standards obtained here to the Rorschachs 
of adolescents in general. The third section presents 
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Buhler’s summary of three group Rorschach studies, 
which point up certain systematic shifts in Rorschach 
scores obtained from group rather than individual 
administration. The final paper, by Peak, reports 
the results of a study of popular responses occurring 
in the records obtained for the 1948 Buhler stand- 
ardization study. Peak employs 33 per cent occur- 
rence as a criterion of popularity, and lists all re- 
sponses whose values meet or exceed that criterion. 
All four studies represent significant contributions to 
the task of Rorschach standardization —A. M. 


Bychowski, Gustav. Psychotherapy of psychoses. 
New York: Grune & Stratton, Inc. 1952. Pp. 
viii + 328. $5.75. 


Contributions to therapeutic techniques with psy- 
chotic patients are becoming increasingly frequent 
in current psychiatric literature. This volume aims 
at synthesizing present knowledge regarding psycho- 
therapy in psychosis; in reality, however, it fur- 
nishes a great deal of information concerning the 
development and phenomenology of psychosis as 
well. The first sixteen chapters, for example, are de- 
voted in large part to a description of schizophrenic 
disorder, particularly to the phenomena of regres- 
sion, changes in ego structure, defense mechanisms, 
and the development of delusional and hallucina- 
tory behavior. It is in the latter half of the book 
that such problems of therapeutic management as 
the handling of hostility, transference, and interpre- 
tation are considered. Except for a brief series of 
chapters on depression, primary emphasis is given 
to schizophrenia. The viewpoint of this presentation 
is predominantly psychoanalytic; mention is made 
not only of Bychowski’s own theoretical and practi- 
cal contributions to psychiatry from this point of 
view, but also of most of the important historical 
and contemporary analytic contributors to the under- 
standing of psychosis. The author’s style is so read- 
able, and his use of clinical case material so appro- 
priate and generous, that the reader feels almost as 
if he is receiving personal instruction in psycho- 
pathology and therapeutic technique from a sympa- 
thetic and experienced teacher—A. M. 


Greene, Edward B. Measurements of human be- 
havior. (Rev. Ed.) New York: Odyssey Press, 
1952. Pp. xlii + 790. $4.75. 


Like its earlier edition of 1941, this revision con- 
sists mainly of a compilation of descriptions of 
tests, most of them extracted uncritically from test 
manuals. It is up to date only in mentioning many 
recently pubiished tests. The data about the older 
tests tend to be unchanged from the first edition, 
even when relevant critical studies exist which 
should be cited. Two new chapters, on military tests 
and on personality theory, add little to the useful- 
ness of the book.—L. F. S. 


Lewis, Oscar. Life in a Mexican village: Tepoztlan 
restudied. Urbana, Ill.: Univer. of Illinois Press, 
1951. Pp. xxvii + 512. $7.50. 


The repetition of an anthropological study of a 
small community is a rare, perhaps a unique, event. 
Twenty years after Redfield’s notable pioneer in- 
vestigation of 1926, Tepoztlan was restudied with 
augmented resources. To some extent, the new 
work bears on the social changes which took place 
during the interval. Even more it is a methodo- 
logical contribution, a retest reliability study, which 
shows the effects upon observations and conclusions 
of having more informants, of using newer methods 
of analysis, and of holding a broader theoretical 
viewpoint. Much of the study is relevant to psy- 
chology, especially the observations on child rearing 
and on the personalities of the people. Rorschachs 
of 106 persons, interpreted by Theodora M. Abel 
and Renata A. Calabresi, seem consonant with the 
personality pictures obtained by other methods. 
Even without these special considerations, the inter- 
esting and handsomely illustrated volume can be 
recommended on its own merits. It is good read- 
ing—L. F. S. 


Lorand, Sandor. (Ed.) The yearbook of psychoan- 
alysis. Vol. VII, New York: International Uni- 
versities Press, Inc., 1952. Pp. 271. $7.50. 


Volume VII contains twenty selected articles from 
psychoanalytic periodicals published in 1950, and 
Parts II and III of a survey of literature on dream 
interpretations by Fliess. Two of the articles and, 
to the reviewer the most interesting, are really re- 
views of books—one, by Nunberg, of Freud's 
Outline of Psychoanalysis, and the other a prepubli- 
cation review of hitherto unpublished writings of 
Freud from 1887 to 1902. In the first of these re- 
views Nunberg says, “The Outline is of all his 
summarizations the simplest, clearest and yet most 
comprehensive.” The review itself can be described 
in the same terms. The bulk of the volume deals 
with what is probably a cross section of psychoan- 
alytic preoccupation at the present time: renewed 
emphasis on dream analysis; reorganization of older 
concepts in the light of new experience with 
patients; physiological relationships to emotional 
problems; psychoanalysis and social problems; and 
techniques of terminating analysis, to mention the 
major trends. For those who do not have ready 
access to the major psychoanalytic periodicals, this 
volume serves as an anthology of the past year’s 
analytic literature—M. K. 


Piaget, Jean. Play, dreams and imitation in child- 
hood. (Trans. by C. Gattegno & F. M. Hodgson 
New York: Norton, 1952. Pp. ix + 296. $5.00. 


Followers of Piaget’s provocative studies of child 
development will welcome this English translation 
of La Formation du Symbole chez l’Enfant. This 
book takes its place as the third of a series (includ- 
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ing La Naissance de I’ Intelligence chez l’ Enfant, and 
La Construction du Reel chez l’Enfant) based on 
detailed observations of the same three children 
from birth to early childhood. Three chapters deal 
with the stages in the development of imitation, 
from the systematic imitation of sounds to 
representative imitation. Four chapters describe 
the development and characteristics of play. Here 
Piaget’s now-familiar analysis of the structure and 
rules of children’s games is expanded to include a 
critical treatment of theories of play, and a stimu- 
lating discussion of unconscious symbolism in 
dreams as well as play. The final three chapters 
concern the evolution of cognitive representation 
from sensory-motor schemas to conceptual schemas. 
In dealing with behavior which is often subtle and 
elusive, Piaget is, as usual, the careful observer, 
meticulous recorder, and imaginative theorizer. The 
translators have done an excellent job in providing 
a glossary of unfamiliar terms and in making this 
a clear and readable volume.— A. M. 


Vernier, Claire Meyers. Projective drawings. New 
York: Grune & Stratton, 1952. Pp. vii + 168. 
$6.00, 


Proyective Drawings is a very attractive, spiral- 
bound publication built around perhaps a hundred 
full-page reproductions of human figure drawings. 
These are grouped in the major clinical categories, 
including the “normal.” Sixty different cases are 
represented, and each case except the “normals” 
contains a brief clinical statement and some com- 
ment about the distinguishing elements in the 
drawings. In spite of frequent warnings by the 
author about the tentative nature of the analyses 
of the drawings, it is quite likely that this publi- 
cation will be used as a manual for the clinical 
interpretation of drawings, since the material is 
concisely presented. This would be a misuse of the 
publication. It is by no means a research report; it 
is an attempt to present impressions gained from 
drawings of clinically diagnosed cases. Some of the 
impressions are clear and some are vague and 
overlapping. The “normal” group is the most con- 
fusing of all, since it presents many of the elements 
characteristic of disturbed groups. This report is a 
brave beginning, but only a beginning. Experienced 
psychologists will find it valuable. Others should 
handle the material gingerly —M. K. 


Whitehorn, John C. (Chm.) Psychiatry and medical 
education. Report of the 1951 Conference on Psy- 
chiatric Education. Washington: American Psy- 
chiatric Association, 1952. Pp. viii -+ 164. 


In a 1951 conference, a group of 82 psychia- 
trists, deans of medical schools, and allied specialists 
considered the place of psychiatry in general 
medical education. The report deals with five maia 
topics: the medical student as a person, the setting 
of medical education, the undergraduate curriculum 


in psychiatry, the administrative organization of 
psychiatric teaching, and community needs in re- 
lation to the general physician. The sections on the 
assets and liabilities of medical students as persons, 
and on the content and methods of psychiatric 
teaching are especially good. There is considerable 
common ground between the basic training seen as 
desirable by psychiatry and by clinical psychology. 
— L.F.S. 


Tests 


Lee, J. Murray, & Clark, Willis W. Lee-Clark 
Reading Readiness Test. 1951 Revision. Kgn. - 
gr. 1. 1 form (15) min. Test booklets ($1.50 per 
25) with manual, pp. 12; specimen set (35¢). 
Los Angeles: California Test Bureau, 1951. 


This test predicts readiness to learn to read from 
measures of letter and word discrimination and 
picture vocabulary. Data indicate adequate reli- 
ability and useful correlations with subsequent 
reading achievement. The norms are well expressed 
in terms of the probable per cent of failures to 
learn to read among children making each score. 
Although the test seems to have value, a criticism 
can be made of the 1951 “revision.” It is not a re- 
vision in the usual sense, but a reprinting of the 
test booklet and manual in a new format. The test 
content is unchanged from the preceding (1943) 
version. The manual states that “this revised 
edition. . .was standardized on 5,000 cases. . .,” but 
the table of norms is identical in every respect to 
that published in 1943. These facts seem to indicate 
that the so-called “revision” was designed to serve 
commercial rather than professional objectives.— 
L, F. S. 


Leiter, Russell G. Manual for the 1948 revision of 
the Leiter International Performance Scale, Part 
II. Washington: Psychological Service Center 
Press, 1952. Pp. 85. $2.00. 


This manual consists of revised directions for ad- 
ministering and scoring the 1948 revision of the 
Scale. There are no data concerning its standardi- 
zation or validity.—L. F. S§. 


Sullivan, Elizabeth T., Clark, Willis W., & Tiegs, 
Ernest W. California Test of Mental Maturity. 
1951 Edition. 5 levels: grades kgn-1, 1-3, 4-8, 
7-10, 9-adult. 1 form at each level. (50) to (92) 
min. according to level. Test booklets ($2.75 per 
25) with manual, pp. 20-28, key, record sheet. 
IBM answer sheet (4¢); Scoreze answer sheet 
(7¢); specimen set, 1 level (35¢). Los Angeles, 
California Test Bureau, 1951. 


In their first revision since 1946, the California 
tests are presented in attractive new format, but 
with only minor changes in content. The screening 
tests for visual and auditory acuity and for motor 
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coordination have been eliminated, and the spatial 
relationships section has been reduced from three 
subtests to two by the omission of the mazes. Pre- 
vious reviews have already commented on the vir- 
tues and shortcomings of this series of tests (J. 
consult, Psychol., 1947, 11, 156; 1948, 12, 126; 1951, 
15, 516). The main merits remain the same: the 
wide coverage of ingeniously sampled intellectual 
functions, the emphasis on nonverbal materials, and 
the homogeneous coverage of the range from kinder- 
garten to superior adult. The manual of the new 
edition continues to give only inadequate data on 
validity, but the information concerning the norms 
is improved. The test will probably continue in 
wide use, and seems as serviceable as any other 
group measure of mental ability.—L. F. S. 


Vorhaus, Pauline G. TAT Summary Record Blank. 
Record blank ($1.00 per 25) with manual, pp. 8; 
specimen set (20¢). Yonkers, N. Y.: World Book 
Co., 1952. 


The TAT Summary Record will be an aid to 
those who like to analyze the more formal aspects 
of a TAT record. In compact and convenient form 
it provides a method for recording 18 characteris- 
tics of protocols, under four headings: ways of han- 
dling materials, emotional aspects, central figure— 
environment, and central figure—intrapersonal sit- 
uation. One single-page blank provides space for 
the analysis of the responses to 12 cards. The man- 
ual defines the categories adequately, and presup- 
poses a general acquaintance with the TAT. It has 
an appreciative foreword by Henry A. Murray.— 
L. F. S. 


Books Received 


Berrien, F. K. Practical psychology. New York: 
Macmillan, 1952. Pp. xv + 640. $5.00. 

Chesser, Eustace. Cruelty to children. New York: 
Philosophical Library, 1952. Pp. 159. $3.75. 
Fairchild, Johnson E, (Ed.) Women, society @ sex. 
New York: Sheridan House, 1952. Pp. 255. $4.00. 
Hyndman, Olan R. The origin of life and the ewo- 
lution of living things. New York: Philosophical 

Library, 1952. Pp. xxi + 648. $8.75. 

Kris, Ernst. Psychoanalytic explorations in art. 
New York: International Universities Press, 1952. 
Pp. 358. $7.50. 

Maier, Norman R. F. Principles of human rela- 
tions. New York: Wiley, 1952. Pp. x + 474. 
$6.00. 

National Institute of Industrial Psychology. Joint 
consultation in British industry. London: Staples 
Press, 1952. Pp. 276. 21 s. 

Negri, Vitali. Your complexes and you. Los An- 
geles: American Book Institute, 1952. Pp. 299. 
Ralli, Elaine P. (Ed.) Adrenal cortex. Transac- 
tions of the Third Conference, November 15-16, 
1951. New York: Josiah Macy, Jr. Foundation, 

1952. Pp. 204. $3.25. 

Runes, Dagobert D. Of God, the Dewil and the 
Jews. New York: Philosophical Library, 1952. 
Pp. 186. $3.00. 

Stagner, Ross, & Karwoski, T. F. Psychology. New 
York: McGraw-Hill, 1952. Pp. xiii + 582. $5.00. 

Thurstone, L. L. (Ed.) Applications of psychology. 
Essays in honor of Walter V. Bingham. New 
York: Harper, 1952. Pp. x + 209. $3.00. 

Wolff, Charlotte. The hand in psychological diag- 
nosis. New York: Philosophical Library, 1952. 
Pp. xv + 218. $7.50. 











RORSCHACH STANDARDIZATION STUDIES, $4.95. 


DEVELOPMENT OF THE BASIC RORSCHACH SCORE WITH MANUAL OF DIRECTIONS 


(Mimeographed) Vol. 1. — 3rd edition — 1951 


BY CHARLOTTE BUHLER, KARL BUHLER, D. WELTY LEFEVER 
RORSCHACH STANDARDIZATION STUDIES, $2.50 
(MIMEOGRAPHED) 


DEVELOPMENT OF THE BASIC RORSCHACH SCORE—SUPPLEMENTARY MONOGRAPH, 1952. 
BY CHARLOTTE BUHLER, D. WELTY LEFEVER, FRANCES E. FALLSTEDT, HORACE M. PEAK 


AVAILABLE FROM 


RORSCHACH STANDARDIZATION STUDIES 


1127 N. Sweetser Ave., Los Angeles ‘4, California 


The Journal of Nervous and Mental Diseases, Feb. 
1960 (Rev. by Piotroski) says: “Everyone actually 
engaged in Rorechach practice will te acquaint 

himself with this contribution.” 
The American Journal of Orthopsychiatry (A. G. 
Woltman) says: ..... “A mew concept of the ‘Basic 
Rorschach Score’ is the keystone of the present study. 
and scering, sample cases, This score is derived from a list of 102 diagnostic sign 
tistical proceedure eroployed, and reliability patterns checked for their occurrence in a group of 
and validity.” 578 cases. . . Critical ratios were derived from the 
The American Journal of Psychotherapy, distribution of each of these signs for cach group of 
(Rev. by Herman Molish) the subjects te determine their ‘positive’ and ‘negative’ 


contributions of this stady is the m= association; Le. whether they occurred in normals 


sents for the (‘positive’) or im the groups with personality diffi- 
and ego-integration. culties (‘negative’). Weights were assigned to exch of 
the diagnostic signs. .. . An important step in this 
research is the formulation of four levels of person- 
ality integration. These levels are determined sta- 
tistically by the distribution of the Basic Rorschach 
Sevres in the various clinical groups. The rationale 


integ 
stresses the need of functional classification rather 
than the traditional clinical categories. These levels 
are: adequacy, conflict, impairment, and reality loss.” 
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This important volume offers a clear and straight: 
Catholic position on the value and place of psych 
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By Newait C. Kepnaarr, Purdue Universit 


A treatment of the employment interview as a basi 
and placing workers, this text considers the inte 
a8 an integrated part of the entire selection pro 
while facts themselves are relatively easy to obt 
what facts are needed, how to realize when th 
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By H. W. Karn and B. von Hauser Git 
McGraw-Hill Publications in Psychology. 4 
bound) $3.50 


A collection of 53 representative articles which p 
solution of some of the more pressing psycholog 

Each article is an integrated presentation in itse 
to other articles in the book, although the mat: 
consideration of all the material in the book w 

overlapping of content. 
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